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DESCRIPTION 

Methods for Phenotvpe Cr eation 
From Multiple Gene Populations 

Cross Reference to Related A pplication 

This is a continuation-in-part application of copend- 
ing application Serial No. 513,957, filed April 24, 1990 
which is a continuation-in-part of Serial No, 353,235, 
5 filed May 16, 1989, and Serial No. 353,241, filed May 17, 
1989, the disclosures of which are hereby incorporated by 
reference. 

Field of the Invention 

The present invention relates to methods for randomly 
10 combining populations of nucleotide sequences and select- 
ing those combinations coding for a desired predetermined 
phenotype . 

Background of the Invention 

The production of genetic variants, including vari- 

15 ants of both polypeptides and organisms such as bacteria 
and phage, has been a goal in the work of many individuals 
involved in recombinant DNA technologies. For example, 
researchers have beneficially relied upon random genetic 
recombination in the past for the production of new and 

20 useful microorganisms. Genetic recombination includes a 
variety of processes that produce new linkage relation- 
ships of genes or parts of genes. Genetic recombination 
is often subdivided into general genetic recombination, 
which takes place between homologous chromosomes, more or 

the existence of specific sites in one or more molecules 
and which includes interactions of viral qenomes and 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



2 

eukaryotes, and less well defined instances of recombina- 
tion that appear to require neither extensive homology nor 
special sites. Variable gene expression can also result 
in production of various combinations of polypeptides, the 
5 immune system being one example of such protein 
combination. 

The immune system of a mammal is one of the most 
versatile biological systems; probably greater than 1.0 x 
10 7 antibody specificities can be produced. Indeed, a 

10 great deal of contemporary biological and medical research 
is directed toward tapping this repertoire. During the 
last decade, furthermore, there has been a dramatic 
increase in the ability to harness the output of the 
immune system. The development of the hybridoma method- 

15 ology by Kohler and Milstein has made it possible to 
produce monoclonal antibodies, i.e. , a composition of 
antibody molecules of single epitope specificity, from the 
repertoire of antibodies induced during an immune 
response. Monoclonal antibodies have been generated in 

20 the past from hybridomas, generated by fusing antibody- 
secreting lymphocytes with an immortal cell line, such as 
myeloma . 

Although standard hybridoma technology has been 
extremely valuable, the screening of fused cells to iden- 

25 tify hybridomas expressing useful antibody molecules is 
labor intensive, time consuming and expensive. Moreover, 
the standard technology yields rodent antibody molecules 
that have two clear disadvantages. The first is that 
subtle variations in certain human antigenic systems, such 

3 0 as major histocompatibility proteins, are not easily dis- 
tinguished by non-primate antibodies. Therefore, rodent 



3 5 monoclonal antibodies are not capable of efficiently 
surveying the entire antibody response induced by a 
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are at least 5-10,000 different B-cell clones capable of 
generating unique antibodies to a small relatively rigid 
immunogens, such as, for example dinitrophenol. Further, 
because of the process of somatic mutation during the 
5 generation of antibody diversity, essentially an unlimited 
number of unique antibody molecules may be generated. In 
contrast to this vast potential for different antibodies, 
current hybridoma methodologies typically yield only a few 
hundred different monoclonal antibodies per fusion. A 
10 second major drawback in hybridoma technology is that 
rodent antibodies are highly immunogenic in humans, and 
can preclude their continued use in patients for 
diagnostic or therapeutic purposes. 

One alternative is to produce human cells that 
15 express antibody. Unfortunately, it is quite difficult to 
identify and produce pure human monoclonal antibodies. 
Standard methods used to immortalize antibody-producing 
cells are less than satisfactory. One approach that 
circumvents the need for human hybridoma cells has been to 
2 0 use recombinant DNA technology to express fusion antibody 
proteins. These molecules have amino terminal variable 
domains of the light and heavy chains derived from a 
specific rodent monoclonal antibody and the carboxy 
terminal constant region domains derived from a human 
2 5 antibody. The use of human constant regions diminishes 
the human anti-globulin immune response,, avoiding the 
stimulation of anti-isotypic antibody-producing B cells. 
However, the rodent-derived variable region framework 
domains still elicit a response that is more severe than 
a variable domain response directed against a pure human 
- nt- 1 bnri v 



30 



jomains, some researchers have taken a human antibody and 
35 replaced the hypervariable regions (CDRs) with 

hvrervariable reaion- ^or a -odonf antibody specif ir ^o- 
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affinity for antigen comparable to the parent rodent 
antibody, the process of grafting all rodent CDRs into a 
human immunoglobulin gene is technically challenging. 

Aside from repertoire specificity and immunogenicity , 
5 other drawbacks in producing monoclonal antibodies with 
the hybridoma methodology include genetic instability and 
low production capacity of hybridoma cultures. One means 
by which the art has attempted to overcome these latter 
two problems has been to clone the immunoglobulin- 
10 producing genes from a particular hybridoma of interest 
into a procaryotic expression system. See, for example, 
Robinson et al., PCT Publication No. WO 89/0099; Winter et 
al., European Patent Publication No. 0239400; Reading, 
U.S. Patent No. 4,714,681; and Cabilly et al., European 
15 Patent Publication No. 0125023. 

The immunologic repertoire of vertebrates has 
recently been found to contain genes coding for 
immunoglobulins having catalytic activity. Tramontano et 
al., Sci. . 234:1566-1570 (1986); Pollack et al., Sci. , 
20 234:1570-1573 (1986); Janda et al., Sci. . 244:437-440 
(1989) . The presence of, or the ability to induce the 
repertoire to produce, antibody molecules capable of a 
catalyzing chemical reaction, i.e., acting like enzymes, 
had previously been postulated almost 20 years ago by W. 

2 5 P. Jencks in Catalysis in Chemistry and Enzvmology , 

McGraw-Hill, N.Y. (1969). 

It is believed that one reason the art failed to 
isolate catalytic antibodies from the immunological 
repertoire earlier, and its failure to isolate many to 

3 0 date even after their actual discovery, is the inability 

ne iiyoriaoma technique, towards the production high 
35 affinity antibodies inherently designed for participation 

; n the nrorpn r - nf rp'.itr? 1. i 7. t i cn .. " "-rr: " : . ; : * •* 
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In an attempt to enhance the designed recombination 
of desired DNA sequences or the desired combination of 
otherwise randomly generated polypeptides, including the 
identification and production of pure human monoclonal 
5 antibodies, we have pursued alternative approaches for the 
production and screening of such nucleotide sequences and 
polypeptides . 

Summary of th e Invention 

The present invention is directed to methods for pro- 

10 ducing biological agents having a desired novel phenotype 
wherein this phenotype results from expression of a 
particular combined nucleotide sequence and wherein said 
phenotype can be used to identify the biological agents 
having the particular combined nucleotide sequence and 

15 distinguish them from biological agents having other 
combined nucleotide sequences. The desired phenotype is 
typically a phenotype which is not normally expressed by 
the parent nucleotide sequences. In one embodiments these 
methods comprise first replicating at least portions of 

20 two parent nucleotide sequences. The replicating step 
yields a population of diverse replicas of parent nucleo- 
tide sequences. In one embodiment, each parent nucleotide 
sequence initially comprises a population (or family) of 
diverse nucleotide sequences which is replicated to give 

25 a population of diverse replicas. Alternatively, a popu- 
lation of diverse replicas is generated by replicating a 
parent nucleotide sequence under conditions which allow 
mutations to occur which generates diversity from one 
parent nucleotide sequence and results in a population of 



DNA moiecuies. Where the parent, nucleotide sequences com- 
prise one DNA molecule, after replication, the resulting 

r^pul at i cr- of diverno ^rl iczz dor 4 -oi r -~r " -r r — 
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diverse replicas are then brought together, preferably in 
a random manner, to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas. The parent 
5 nucleotide sequences may be suitably replicated and 
brought together according to the various methods 
described herein for replication and recombination of 
nucleotide sequences and generation of combinatorial 
libraries. The combined nucleotide sequences are 
10 expressed in biological agents. Such biological agents 
may comprise a host cell, or alternatively, a plasmid, 
bacteriophage or virus, or nucleic acid vector, and such 
suitable means for expression are described herein. In 
one embodiment < expression may constitute the mere exist- 
15 ence of th enucleotide sequences in the same biological 
agent. Then, the biological agents which express the 
desired phenotype are identified. If desired, the pheno- 
type is used to distinguish those biological agents 
expressing the particular combined nucleotide sequence 
20 from biological agents expressing other combined nucleo- 
tide sequences. The desired phenotype may comprise a 
polypeptide, more than one polypeptide, or a multimeric 
polypeptide, the expression of which is detectable . 
Alternatively, the phenotype may comprise synthesis of one 

2 5 or more RNA molecules. Optionally, either the polypep- 

tides or RNA may exhibit enzymatic activity or receptor 
activity; or the DNA or RNA may simply act as a target for 
interaction with other molecules. 

The present invention provides novel methods for the 

3 0 cloning of cells having novel phenotypes. These methods 



starting populations ot nucleotide sequences which can be 
35 recombined to form a library of clones containing nucleo- 

i~ -I r . r~ pi rri i (~i t-> r-» rs r* f y r^-*** ■ ■■ ■ , / - V ■ ■■■■(" V -, ri v ■ ; , . - ,~, *- > , , , *~ : ' ! " ' r . £t r ■- 
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novel phenotypes, that is, cells having a new and desired 
combination of expressed polypeptides. These methods can 
also be used for the production of new combinations of 
polypeptides, including the polypeptides utilized in the 

5 formation of biologically competent immunoglobulin mole- 
cules, in accordance with the latter object of the 
invention, these methods can be used to screen a larger 
portion of the immunological repertoire for receptors 
having a preselected activity than has heretofore been 
10 possible, thereby overcoming the before-mentioned 
inadequacies of the hybridoma technique. 

In another embodiment, the present invention 
contemplates a gene library comprising an isolated admix- 
ture of at least about 10 3 , preferably at least about 10 

15 and more preferably at least 10 s V H -and/or V L -coding DNA 
homologs, a plurality of which share a conserved antigenic 
determinant. Preferably, the homologs are present in a 
medium suitable for in vitro manipulation, such as water, 
phosphate buffered saline and the like, which maintains 

20 the biological activity of the homologs. 

in one embodiment, at least two starting populations 
of DNA sequence-containing vectors are physically combined 
by any of several techniques, including those described 
herein, to form a library of clones containing DNA 

25 sequences from each of the parent populations. Alterna- 
tively there may be more than two gene families and the 
vectors produced thereby may contain a random assortment 
of one member of each gene family to create the identifi- 
able characteristic. These vectors can then be 

30 transferred to desired host cells to create in vivo novel 

,_v^r^+-<on^ ~* phpnntvpir -harsrtpri ?t i re < r, the host 



recombination, ana site-specit ic recombination by metnoas 
3 5 including intergrase-related proteins, flp recombinase- 

.^^.... r ^ ,.„ rnTbinatinn , the cre-lox fwst.er- ~'' b^rterio- 
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In a still further embodiment, the present invention 
contemplates vectors for use in the methods which 
comprise, in addition to random DNA sequences from the 
starting gene family populations, DNA sequences which 
5 facilitate the region-specific, random recombination 
together of at least one gene from each starting gene 
family population. Sequences enabling the recombination 
of these vector* include the use of functional flp recom- 
bination sequences, functional loxp recombination 

10 sequences, at sequences recognized by integrase-related 
proteins from lambdoid bacteriophages, and terminal repeat 
sequences recognized by transposases. Thus, the present 
invention also includes methods for the combinatorial 
generation of phenotypes, including a method of producing 

15 a nucleic acid vector encoding two or more desired genes 
each from a family of genes, said genes being capable of 
producing a characteristic that can be used to identify 
the vector encoding said genes from other vectors encoding 
other members of the families of genes, which method 

2 0 comprises: 

a) randomly inserting into vectors one member of a 
first family of genes and one member from one or more 
other families of genes so that a population of vectors 
are created wherein each vector may contain one of the 

2 5 genes from said first gene family and one of the genes 
from each of said other gene families; 

b) identifying within said population of vectors a 
vector capable of detectably producing a desired charac- 
teristic resulting from the inclusion of one gene from 

30 said first gene family and one gene from each of said 



T ■' 



members from saia gene ramilies. 
3 5 Suitable vectors for use according to the methods of 

V- r -- -v o o T"y ""' " ' T r~* * +- ~ : ■ • ■■■ * - ■ ■ 1 ? ' 3 t' T ■ " VT * ■■ """" ] " i '• " ' *" ' y ; " 
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expressing the vectors comprise either eukaryotic cells or 
prokaryotic cells. Preferred eukaryotic cells include 
mammalian cells. In one preferred aspect, the vectors 
comprise lambda bacteriophage and host cells comprise E. 
5 Coli. 

Preferably, the genes are combined in vivo. 
Various suitable methods may be used for the identi- 
fication of a particular vector within the recombinant 
vector population. These methods include (a) the inter- 

10 action of sequence-specific nucleic acids with genes from 
the individual families which were combined: (b) the 
hybridization of nucleic acid probes with genes from the 
gene families; (c) the expression of one or both genes 
from the gene families as an RNA molecule; and (d) the 

15 expression of one or both genes as an identifiable protein 
molecule. Optionally, such an identifiable protein 
molecule may contain a binding site for another molecule, 
an epitope recognized by an antibody, or an immune 
molecule binding site for an epitope. In a preferred 

20 identification method, both genes express an RNA and/or 
polypeptide and said RNAs and/or polypeptides physically 
interact with a host to create an identifiable character- 
istic. Both genes may express polypeptides that physic- 
ally interact to form a neo-epitope recognized by an 

2 5 immune molecule or polypeptides that physically interact 
to form a binding site for another molecule. Optionally 
those polypeptides are derived from antibody genes such 
that the interaction of both polypeptides forms an antigen 
binding site. 

In another preferred aspect, the vectors produced 



30 



expressed rro::i tnen own promoter . 
35 i n a still further embodiment, the present invention 

mntor.ri atos th* creation of combinations of rore 
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recombination. Such in vitro recombination could be 
carried out using specific recombination target sequences 
and specific recombinases (like flp recombinase) , or by 
using homologous sequences shared by both nucleotide 
5 sequence populations to facilitate homologous 
recombination . 

One method to accomplish a form of homologous 
recombination in vitro is by using in vitro nucleic acid 
amplification methods such as the polymerase chain reac- 
10 tion (PCR) . If both of two populations of DNA sequences 
share a region of homology, then it is possible during the 
PCR for base-pairing to occur between single stranded 
nucleic acid molecules from both populations of nucleotide 
sequences. If such base pairing creates a "primer- 
15 template complex" that can be used by a polymerase to 
begin synthesis of complementary strands, then a fusion 
product is created which will contain sequences from both 
nucleotide sequence populations (See Figure 21 here) . If 
the shared region of homology is present on most or all of 
20 the two nucleotide sequence populations, then most or all 
of the nucleotide sequences can participate in such 
recombination. Thus, a combinatorial population of fusion 
nucleotide sequences can be produced, and subsequently 
inserted into a single expression vector for expression of 
25 the nucleotide sequence from both sequence families. Such 
a combinatorial population of expressed sequences can then 
be screened for new phenotypes that would not be present 
if the sequences from only one population of nucleotide 
sequences were expressed, and would be present only with 
3 0 expression of particular combinations comprising a nucleo- 

nucleotide sequence family and the other subunit of the 
35 dimer is encoded by the other nucleotide sequence family. 

- i . . n. • v - , ^. . ... t . ,. ■ . - - - •• ■ 
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10 



of nucleotide sequences which may be combined to give a 
diversity of phenotypes, from which a desired phenotype 
may be selected. Such diversity may be generated starting 
with a single DNA molecule which is treated to create 
diversity, such as by mutagenesis or by starting with a 
family of nucleotide sequences (or genes) or a 
combinatorial library. 

For example.- one may start with a plasmid containing 
antibody sequences coding for both a light chain and a 
heavy chain which has been isolated from a known 
monoclonal -antibody producing cell line. The nucleotide 
sequences coding for the light chain and the heavy chain 
may be individually amplified (using a method such as PCR) 
under conditions that mutated sequences are generated to 
15 create a population of mutated sequences. The individual 
populations of mutated sequences may be used to make com- 
binatorial libraries which are then used to create novel 
phenotypes. Alternatively, these individual populations 
of mutated sequences may be combined using techniques such 
20 as fusion polynucleotide amplification (for example) 
fusion PCR (as described herein) and used to generate 
novel phenotypes. These novel phenotypes may include 
antibodies having enhanced antigen binding 
characteristics . 

2 5 According to another aspect of the present invention, 

one or more genetically distinct phage may be lytically 
replicated, conditions which are somewhat mutagenic, to 
generate a population (s) of diverse phage. Phage having 
phenotypes distinct from the originals may be generated by 

3 0 cleavage such as by a restriction endonuclease , followed 



parental phage may be generated combinatonaiiy . 
35 in another embodiment, the methods are utilized to 
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gene library containing a substantial portion of the V H 
gene repertoire of a vertebrate is synthesized. In pre- 
ferred embodiments, the V H -coding gene library contains at 
least about 10 3 and more preferably at least about 10 4 and 
5 more preferably at least about 10 5 different V H -coding 
nucleic acid strands referred to herein as V H -coding DNA 
homologs. 

The gene library can be synthesized by various 
methods, depending on the starting material. Where the 
10 starting material is a plurality of V H -coding genes, the 
repertoire is subjected to two distinct primer extension 
reactions. The first primer extension reaction uses a 
first polynucleotide synthesis primer capable of initiat- 
ing the first reaction by hybridizing to a nucleotide 
15 sequence conserved (shared by a plurality of genes) within 
the repertoire. The first primer extension reaction 
produces a plurality of different V H -coding homolog 
complements (nucleic acid strands complementary to the 
genes in the repertoire) . The second primer extension 
20 reaction produces, using the complements as templates, a 
plurality of different V H -coding DNA homologs. The second 
primer extension reaction uses a second polynucleotide 
synthesis primer that is capable to initiating the second 
reaction by hybridizing to a nucleotide sequence conserved 
25 among a plurality of V H -coding gene complements. 

Where the starting material is a plurality of 
complements of different V H -coding genes provided by a 
method other than the first primer extension reaction, the 
repertoire is subjected to the above-discussed second 
3 0 primer extension reaction. That is, where the starting 

. r.c. , Lilt- complements are subjected to a primer extension 
35 reaction using a polynucleotide synthesis primer that 

hvbr i d i ze? p nluralirv of fho diffprer. 4- T * ^od^rr: '■: r. ■ 
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V-coding genes and their complements are present in the 
starting material, both approaches can be used in 
combination. 

A V H -coding DNA homolog, i.e. , a gene coding for a 
5 receptor capable of binding the preselected ligand, is 
then segregated from the library to produce the isolated 
gene. This may be accomplished by operatively linking for 
expression a plurality of the different V H -coding DNA 
homologs of the library to an expression vector. The V„- 

10 expression vectors so produced are introduced into a popu- 
lation of compatible host cells, i.e., cells capable to 
expressing a gene operatively linked for expression to the 
vector. The trans formants are cultured under conditions 
for expressing the receptor coded for by the V H -coding DNA 

15 homolog. The transf ormants are cloned and the clones are 
screened for expression of a receptor that binds the pre- 
selected ligand. Any of the suitable methods well known 
in the art for detecting the binding of a ligand to a 
receptor can be used. A transf ormant expressing the 

20 desired activity is then segregated from the population to 
produce the isolated gene. 

A receptor having a preselected activity produced by 
a method of the present invention, preferably a V H or F v as 
described herein, is also contemplated. 

25 The present invention also encompasses products 

produced by the methods of the invention, such as the 
biological agents produced thereby, also the expression 
products of these mathods such as polypeptides and nucleic 
acids, vectors produced and kits comprising any of the 

3 0 products of the claimed methods. 



Figure i illustrates a schematic diagram o; tne 
immunoglobulin molecule showing the principal structural 
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biologically active (ligand binding) portion of that 
region, and a gene coding for that polypeptide, are 
produced by the methods of the present invention. 
Sequences L03, L35, L47 and L48 could not be classified 
5 into any predefined subgroups. 

Figure 2 A is a diagrammatic sketch of an H chain of 
human IgG (IgGl subclass). Numbering is from the N- 
terminus on the left to the C-terminus on the right. Note 
the presence of four domains, each containing an intra- 
10 chain disulfide bond (S-S) spanning approximately 60 amino 
acid residues. The symbol CHO stands for carbohydrate. 
The V region of the heavy (H) chain (V H ) resembles V L in 
having three hypervariable CDR (not shown) . 

Figure 2B is a diagrammatic sketch of a human K chain 
15 (Panel 1) . Numbering is from the N -terminus on the left 
to the C-terminus on the right. Note the intrachain 
disulfide bond (S-S) spanning about the same number of 
amino acid residues in the V L and C L domains. Panel 2 
shows the locations of the complementarily-determining 
20 regions (CDR) in the V L domain. Segments outside the CDR 
are the framework segments (FR) . 

Figure 3 depicts the amino acid sequence of the V H 
regions of 19 mouse monoclonal antibodies with specificity 
for phosphorylcholine. The designation HP indicates that 
25 the protein is the product of a hybridoma. The remainder 
are myeloma proteins. (From Gearhart et al., Nature , 
291:29, 1981.) 

Figure 4 illustrates the results obtained from PCR 
amplification of mRNA obtained from the spleen of a mouse 
3 0 immunized with FITC. Lanes R17-R24 correspond to ampli- 
fication reactions witr thp urnrrup ^ ' ^rimers f *~> — q t r Mp 

... jnu ^ prinei ;aDie ana R'j are trie arcpi^- 

35 fication controls; control Z involves the amplification of 

V for a nlasmid t PT..R? ) and R° represrnts the annl i f i oat i or. 
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from the constant regions of spleen mRNA using primers 11 

and 13 (Table 1) . 

Figure 5 depicts nucleotide sequences of clones from 
the cDNA library of the PCR amplified V H regions in Lambda 
5 ZAP vector. The N-terminal 110 bases are listed here and 
the underlined nucleotides represent CDR1 (complementary 

determining region) . 

Figures 6A and 6B depict the sequence of the 
synthetic DNA insert inserted into Lambda ZAP vector to 

10 produce Lambda Zap II V H (6A) and Lambda Zap V L (6B) 
expression vectors. The various features required for 
this vector to express the V H and V L -coding DNA homologs 
include the Shine-Dalgarno ribosome binding site, a leader 
sequence to direct the expressed protein to the periplasm 

15 as described by Mouva et al., J. Biol. — Chem . , 255:27, 

1980, and various restriction enzyme sites used to opera- 
tively link the V H and V L homologs to the expression 
vector. The V H expression-vector sequence also contains a 
short nucleic acid sequence that codes for amino acids 

20 typically found in variable regions heavy chain (V H 
Backbone) . This V H Backbone is just upstream and in the 
proper reading as the V H DNA homologs that are operatively 
linked into the Xho I and Spe I. The V L DNA homologs are 
operatively linked into the V L sequence (6B) at the Nco I 

25 and Spe I restriction enzyme sites and thus the V„ Backbone 
region is deleted when the V L DNA homologs are operatively 

linked into the V L vector. 

Figure 7 depicts the major features of the bacterial 
expression vector Lambda Zap II V H (V H -expression vector) 
are shown. The synthetic DNA sequence from Figure 6 is 

r.hown at the toe alona with the T, polymerase Dromoter from 



30 



_nsert.eu uie ,Mio . aiu ~>pe ■ ; ll x c l . oi , enzyn.t.: 

3 5 sites. The V DNA are inserted into the Xho I and Spe I 
Mte -md the read through transcription produces the 
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decapeptide epitope (tag) that is located just 3 ! of the 

cloning sites. 

Figure 8 depicts the major features of the bacterial 
expression vector Lambda Zap II V L (V L expression vector) 
5 are shown. The synthetic sequence shown in Figure 6B is 
shown at the top along with the T 3 polymerase promoter from 
Lambda Zap II vector. The orientation of the insert in 
Lambda Zap vector II is shown. The V L DNA homologs are 
inserted into the phagemid that is produced by the in vivo 
10 excision protocol described by Short et al., Nucleic Acids 
Res, . 16:7583-7600, 1988. The V L DNA homologs are inserted 
into the Nco I and Spe I cloning sites of the Phagemid. 

Figure 9 depicts a modified bacterial expression 
vector Lambda Zap II V L II. This vector is constructed by 
15 inserting this synthetic DNA sequence, 

TGAATTCTAAACTAGTCGCCAAGGAGACAGTCATAATGAA 

TCGAACTTAAGATTTGATCAGCGGTTCCTCTGTCAGTATTACTT 
ATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCTG 

TATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGAC 
2 0 CCCAACCAGCCATGGCCGAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 

GGGTTGGTCGGTACCGGCTCGAGCAGTCAAGATCTCAATTCGCCGGCAGCT 

into Lambda Zap II vector that has been digested with the 
restriction enzymes Sac I and Xho I. This sequence 
contains the Shine-Dalgarno sequence (ribosome binding 
25 site) , the leader sequence to direct the expressed protein 
to the periplasm and the appropriate nucleic acid sequence 
to allow the V L DNA homologs to be operatively linked into 
the SacI and Xbal restriction enzyme sites provided by 
this vector. 

30 Figure 10 depicts the sequence of the synthetic DNA 

segment inserted into Lambda Zap II vector to produce the 



separately and in combination. The various essential 
components of these vectors are shown. The light chain 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



1 



n 



10 



expression vector to produce a combinatorial vector con- 
taining both V H and V L operatively linked for expression 
to the same promoter. 

Figure 12 depicts the labelled proteins iromuno- 
precipitated from E. coli containing a V H and a V L DNA 
homolog are shown. In lane 1, the background proteins 
immunoprecipitated from E. coli that do not contain a V H or 
V, DNA homolog are shown. Lane 2 contains the V H protein 
immunoprecipitated from E. coli containing only a V H DNA 
homolog. In lanes 3 and 4, the commigration of a V H 
protein a V L protein immunoprecipitated from E. — coli 
containing both a V H and a V L DNA homolog is shown. In 
lane 5 the presence of V H protein and V L protein expressed 
from the V u and V. DNA homologs is demonstrated by the two 
15 distinguishable protein species. Lane 5 contains the 
background proteins immunoprecipitated by anti-E. — coli 
antibodies present in mouse ascites fluid. 

Figure 13 depicts the transition state analogue 
(formula 1) which induces antibodies for hydrolyzing 
2 0 carboxamide substrate (formula 2). The compound of 
formula 1 containing a glutaryl spacer and a N- 
hydroxysuccinimide-1 inker appendage is the form used to 
couple the hapten (formula 1) to protein carriers KLH and 
BSA, while the compound of formula 3 is the inhibitor. 
25 The phosphonamidate functionality is a mimic of the 
stereoelectronic features of the transition state for 
hydrolysis of the amide bond. 

Figure 14 illustrates the PCR amplification of Fd and 
kappa regions from the spleen mRNA of a mouse immunized 
with NPN. Amplification was performed as described in 
rv^npi p i-7 n^-ina RNA cDNA hybrids obtained by the reverse 
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sequences v raoie ~, . udiieb n-to :eprese;u t ^-~i^ 
35 heavy chain amplification reactions with one of each of 
^ 0 0 i-,h<- nrimers 'primers 2-9, Table 1) and the unique 
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cations with the 5' primers (primers 3-6, and 12, respect- 
ively, Table 2) are shown in lanes F9-F13. A band of 7 00 
bps is seen in all lanes indicating the successful 
amplification of Fd and k regions. 
5 Figure 15 depicts the screening of phage libraries 

for antigen binding is depicted according to Example 17C. 
Duplicate plaque lifts of Fab (filters A,B), heavy chain 
(filters E,F) and light chain (filters G f H) expression 

10 with NPN at a density of approximately 30 , 000 plaques per 
plate. Filters C and D illustrate the duplicate secondary 
screening of a cored positive from a primary filter A 
(arrows) as discussed in the text. 

Screening employed standard plaque lift methods. XL1 

15 Blue cells infected with phage were incubated on 150mm 
plates for 4 hours at 37*0, protein expression induced by 
overlay with nitrocellulose filters soaked in lOmM isopro- 
pyl thiogalactoside (IPTG) and the plates incubated at 25° 
for 8 hours. Duplicate filters were obtained during a 

20 second incubation employing the same conditions. Filters 
were then blocked in a solution of 1% BSA in PBS for 1 
hour before incubation with rocking at 25* for 1 hour with 
a solution of 125 I-labelled BSA conjugated to NPN (2 x 10 6 
cpm ml" 1 ; BSA concentration at 01 M; approximately 15 NPN 

2 5 per BSA molecule) in 1% BSA/PBS. Background was reduced 

by pre-centrifugation of stock radiolabelled BSA solution 
at 100,000 g for 15 minutes and pre-incubation of solu- 
tions with plaque lifts from plates containing bacteria 
infected with a phage having no insert. After labeling, 

3 0 filters were washed repeatedly with PBS/0.05% Tween 20 

before development of autoradiographs overnight. 



ir, were exposed to ^I-BSA-NPN in the presence of increasing 
concentrations of the inhibitor NPN. 
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In this study a number of phages correlated with NPN 
binding as in Figure 15 were spotted (about 100 particles 
per spot) directly onto a bacterial lawns. The plate was 
then overlaid with an IPTG-soaked filter and incubated for 
19 hours at 25°. The filter were then blocked in 1% BSA 
in PBS prior to incubation in 125 I-BSA-NPN as described 
previously in Figure 15 except with the inclusion of vary- 
ing amounts of NPN in the labeling solution. Other 
conditions and procedures were as in Figure 15. The 
results for a phage of moderate affinity are shown in 
duplicate in the figure. Similar results were obtained 
for four other phages with some differences in the 
effective inhibitor concentration ranges. 

Figure 17 depicts the characterization of an antigen 
15 binding protein is illustrated according to Example 17D. 
The concentrated partially purified bacterial supernate of 
an NPN-binding clone was separated by gel filtration and 
aliquots from each fraction applied to microtitre plates 
coated with BSA-NPN. Addition of either anti-decapeptide 
20 ( ) or anti-kappa chain antibodies conjugated with alka- 
line phosphatase was followed by color development. The 
arrow indicates the position of elution of a known Fab 
fragment. The results show that antigen binding is a 
property of 50 kD protein containing both heavy and light 

25 chains. 

Single plaques of two-NPN-positive clones (Figure 15) 
were picked and the plasmid containing the heavy and light 
chain inserts excised. 500 ml cultures in L-broth were 
inoculated with 3 ml of a saturated culture containing the 
excised plasmids and incubated for 4 hours at 37 °C. 
Proteins synthesis was induced by the addition of IPTG to 
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35 50 Mi aliquots from the eluted fractions were assayed by 
ELTSA . 
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For ELISA analysis, microtitre plates were coated 
with BSA-NPN at 1 ug/ml, 50 Ml samples mixed with 50 Ml 
PBS-Tween 20 (0.05%)-BSA (0.1%) added and the plates 
incubated for 2 hours at 25 B . After washing with PBS- 
5 Tween 20-BSA, 50 \xl of appropriate concentrations of a 
rabbit anti-decapeptide antibody (20) and a goat anti- 
mouse kappa light chain (Southern Biotech) antibody 
conjugated with alkaline phosphatase were added and 
incubated for 2 hours at 25'. After further washing, 50 
10 Ml of p-nitrophenyl phosphate (lmg/ml in 0.1M Tris pH 9.5 
containing 50 mM MgCl 2 ) were added and the plates incubated 
for 15-30 minutes before reading the OD at 405nm. 

Figure 18A depicts the major features of the 
bacterial expression vector HCFLP containing a V H DNA 
15 homolog and a flp recombination site. 

Figure 18B depicts the major features of the 
bacterial expression vector LCFLP containing a V L DNA 
homolog and a flp recombination site properly oriented for 
recombination with the HCFLP vector. 
20 Figure 19 depicts a diagrammatic sketch of bacterial 

coinfection with HCFLP and LCFLP vectors for the produc- 
tion of recombinant expression vectors containing V L and V H 
DNA homologs. 

Figure 20 depicts an outline showing arm selection 
25 for heavy and light chain recombinant vector products 
using flp recombinase in conjunction with selection based 
on the inclusion of genes having amber mutations. 

Figure 21 shows an outline of a method of phenotype 
creation using the fusion PCR process described herein. 
3 0 Figure 22 illustrates human fusion PCR inside 

primers. The heavy chain C H 1' inside primer sequence is 



■» z^s v "' J "i f""i M •. k -! , i "' "■ r\ c c - t *■ m . * 

; ' ;, 7^ „. - '. -, • *. . ....... i. - •• ... .a. * . i. i -. ! ■ - '•■ I v . . . x - . j.. *••■ * 

3 5 molecule, but the complementary PCR product strands. 
Boxed nucleotides represent regions where the C H l f primer 
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mRNA or where the V L primer hybridizes to the 5 f end of V L 
framework- 1 on human kappa light chain cDNA. Underlined 
sequences indicate the two stop condons. The italicized 
amino acid and nucleotides indicate changes in sequence 
5 from the original pelB leader sequence. The mouse fusion- 
PGR internal primers overlap in a similar manner. 

Figure 23 illustrates an ethidium bromide stained 
agarose gel. After PCR amplification from human cloned 
DNA of heavy chain alone (HC) , light chain alone (LC) , and 
10 the heavy/ light dicistronic DNA molecule (H/L) , DNA sam- 
ples were electrophoresed. The expected sizes of the HC, 
LC, and H/L products visualized on the gel were approxi- 
mately 730, 690, and 1,390 base pairs, respectively. 

Figures 24A and 24B illustrate the major features of 
15 the bacterial expression vector Lambda ZAP II Modified V H 
(Modified ImmunoZAP H) (V H -expression vector) (IZ H) . The 
amino acids encoded by the synthetic DNA sequence from 
Figure 24A is shown along with the T 3 polymerase promoter 
from Lambda ZAP II. The orientation of the insert in 
Lambda ZAP II is as presented. The insert was modified by 
the elimination of the Sac I site between the T 3 polymerase 
and Not I site and by the change of amino acids at the 5 1 
end of the heavy chain from QVKL to QVQL (alysine residue 
was changed to a glutamine residue) . The V H and V L DNA 
2 5 homologs were inserted into the Xho I and Xba I cloning 
sites of the phagemid as described in Figure 2 6 and shown 
in Figure 24B. The modifications were made to create a 
fusion-PCR library from hybridoma RNA, to overcome 
decreased efficiency of secretion of positively charged 
amino acids in the amino terminus of the protein. Inouye 
«t al.. Proc. Natl. Acad. Sci . , USA, 85:7685-7689 U988) , 



20 



30 



: icjureb ana ±on uiustrdte ui«j sequences a; -ne 

3 5 synthetic DNAs inserted into Lambda ZAP to produce Lambda 
7av IT V <TmmunoZAP m f25A) and Lambda Zao V ( ImTnunoZAP 
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required for these vectors to express the V H and V L -coding 
DNA homologs include the Shine-Dalgarno ribosome binding 
site, a leader sequence to direct the expressed protein to 
the periplasm as described by Mouva et al., J. — Biol. 
5 Chem . . 255:27, 1980, and various restriction enzyme sites 
used to operatively link the V H and V L homologs to the 
expression vector. The V H expression-vector sequence also 
contains a short nucleic acid sequence that codes for 
amino acids typically found in variable regions of the 
10 heavy chain (V H Backbone) . This V H Backbone is just 
upstream and in the proper reading frame as the V H DNA 
homologs that are operatively linked into the Xho I and 
Spe I restriction sites. The V L DNA homologs are opera- 
tively linked into the V L sequence (25B) at the Sac I and 
15 Xba I restriction enzyme sites. 

Figure 26 illustrates the major features of the 
bacterial expression vector Lambda Zap II V H (ImmunoZAP H) 
(V H - expression vector) . The amino acids encoded by the 
synthetic DNA sequence from Figure 25A is shown at the top 
20 along with the T 3 polymerase promoter from Lambda Zap II. 
The orientation of the insert in Lambda Zap II is as pre- 
sented. The V H DNA homologs were inserted into the 
phagemid that is produced by the in vivo excision protocol 
described by Short et al., Nucleic Acids Res. , 16:7583- 

2 5 7 600, 1988. The V H DNA homologs were inserted into the 

Xho I and Spe I restriction enzyme sites. The read 
through transcription produces the decapeptide epitope 
(tag) that is located just 3' of the cloning sites. 

Figure 27 illustrates the major features of the 

3 0 bacterial expression vector Lambda Zap II V L (ImmunoZAP L) 

(V, expression vector) . The amino acids encoded by the 



... 1 <: Tit.;; ".. ... -'..j , . 1 •■ . "•> t ' .... ., i.iiJ v- • ' 

3 5 presented. The V L DNA homologs are inserted into the Sac 
I and Xba I cloning sites of the phagemid as described in 
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Figure 28 illustrated en autoradiogram shoving 
signals obtained fro. hu M n phage clones. Approximately 
100 laifcda phage were spotted onto ksU ^wns, creating 
plagues that were overlaid with nitrocellulose filters 
previously SO axed in 10 « isopropylbeta-D-thiogalacto- 
Pyranoside (XPTC, to induce Fab expression. Following 
overnight incubation, the ""- were reacted with X- 
tetanus toxoid probe. After washing, the flit 

. . .. „„i™n on the right represents 

exposed x-u A-Lay i.-Lj-ui. 

the parental clones that were selected for, a combina- 
torial library. Mullinax et al. , Proc_Natl^Acad^_Sc^ 
USA, 87:8095-8099 (1990). The column on the left repre- 
sents clones that were generated by ^^J^ 
combinatorial lambda clone DNA with the V H and C L outside 
primers, C.1- V L inside primers, followed by reclonxng 
in the modified ImmunoZAP H vector. Clone 7G1 is a nega 
tive control which expresses a Fab that does not react 
with tetanus toxoid. Clones 10C1 and 6C12 both produce 
Fabs that react with tetanus toxoid. IZ H is the modified 
heavy chain ImmunoZAP H vector without an insert. 
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Detailed DgscrlBtijSE of the Invention 
A . no-Hnitions 

As used herein, the following terms have the 
following menings unless expressly stated to the contrary: 
jj^cleotide: a monomeric unit of DNA or RNA consist- 
ing of a sugar moiety (pentose), a phosphate, and a nitro- 
genous heterocyclic base. The base is linked to the sugar 
mo iety via the glycosidic carbon (!' carbon of the 
pentose) and that combination of base and sugar is a 

when the nucleoside contains a phosphate 
30 nucleoside. When tne nutieuiiu 

. t • r t- K ~. r~ r> r> t- O t; O 1 T* "! S 



adenine (A) with 
35 guanine (G) in a 



thymine (T) , or of 
double stranded DNA 



to sine v - / ••' lLI - 
molecule. In RNA, 
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Nucleic Acid; a polymer of nucleotides, either 
single or double stranded. 

Gene: a nucleic acid whose nucleotide sequence codes 
for an RNA or polypeptide. A gene can be either RNA or 
DNA. 

Complementary Bases; nucleotides that normally pair 
up when DNA or RNA adopts a double stranded configuration. 

Complementary Nucleotide Sequence ; a sequence of 
nucleotides in a single-stranded molecule of dna or RNA 
that is sufficiently complementary to that on another 
single strand to specifically hybridize to it with 
consequent hydrogen bonding. 

Conserved ; a nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 
15 randomly hybridizes to an exact complement of the 
preselected sequence. 

Hybridization; the pairing of substantially 

complementary nucleotide sequences (strands of nucleic 
acid) to form a duplex or heteroduplex by the establish- 
ment of hydrogen bonds between complementary base pairs. 
It is a specific, i i e_ i . non-random, interaction between two 
complementary polynucleotides that can be competitively 
inhibited. 

Nucleotide Analog; a purine or pyrimidine nucleotide 
that differs structurally from A, T, G, c, or U, but is 
sufficiently similar to substitute for the normal nucleo- 
tide in a nucleic acid molecule. 

DNA qomolog ; is a nucleic acid having a preselected 
conserved nucleotide sequence and a sequence coding for a 
receptor capable of binding a preselected ligand. 

Receptor : A receptor is a molecule, such as a 

Drotein. al 
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a- :orms ie used nerem to refer to immunoglobulin 
molecules and immunologically active portions of immuno- 

alobulin pnii or • . 
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antibody combining site or paratope. Exemplary antibody 
molecules are intact immunoglobulin molecules, substan- 
tially intact immunoglobulin molecules and portions of an 
immunoglobulin molecule, including those portions known in 
the art as Fab, Fab 1 , F(ab') 2 and F(v) . 

Antibody combining Site : An antibody combining site 
is that structural portion of an antibody molecule com- 
prised of a heavy and light chain variable and hypervari- 
able regions that specifically binds (immunoreacts with) 
an antigen. The term immunoreact in its various forms 
means specific binding between an antigenic determinant- 
containing molecule and a molecule containing an antibody 
combining site such as a whole antibody molecule or a 

portion thereof. 
!5 Monoclonal Antibody : The phrase monoclonal antibody 

in its various grammatical forms refers to a population of 
antibody molecules that contains only one species of anti- 
body combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus typically 
20 displays a single binding affinity for any antigen with 
which it immunoreacts. A monoclonal antibody may there- 
fore contain an antibody molecule having a plurality of 
antibody combining sites, each immunospecif ic for a 
different antigen, e.g., a bispecific monoclonal antibody. 
25 Upstream : In the direction opposite to the direction 

of DNA transcription, and therefore going from 5' to 3 1 on 
the non-coding strand, or 3 1 to 5 1 on the mRNA. 

Downstream : Further along a DNA sequence in the 
direction of sequence transcription or read out, that is 
traveling in a 3'- to 5 1 -direction along the non-coding 
strand of the DNA or 5'- to 3 1 -direction alonq the RNA 
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Stop Codon : Any of three codons that do not code for 
an amino acid, but instead cause termination of protein 
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synthesis. They are UAG, UAA and UGA. Also referred to 
as a nonsense or termination codon. 

Leader Polypeptide : A short length of amino acid 
sequence at the amino end of a protein, which carries or 
5 directs the protein through the inner membrane and so 
ensures its eventual secretion into the periplasmic space 
and perhaps beyond. The leader sequence peptide is 
commonly removed before the protein becomes active. 

Reading Frame : Particular sequence of contiguous 
10 nucleotide triplets (codons) employed in translation. The 
reading frame depends on the location of the translation 
initiation codon. 

Inside Primer : An inside primer is a polynucleotide 
that has a priming region located at the 3 f terminus of 
15 the primer which typically consists of 15 to 30 nucleotide 
bases. The 3* terminal-priming portion is capable of 
acting as a primer to catalyze nucleic acid synthesis. 
The 5 1 -terminal priming portion comprises a non-priming 
portion. 

20 Outside Primer : An outside primer comprises a 3 1 - 

terminal priming portion and a portion that may define an 
endonuclease restriction site which is typically located 
in a 5 '-terminal non-priming portion of the outside 
primer. 

25 Fusion Polynucleotide Amplification : refers to in 

vitro techniques of generating a multiple complementary 
copies of a nucleic acid template which comprises nucleo- 
tide sequences which have been randomly combined to give 
a combined nucleic sequence. These techniques typically 

3 0 employ complementary primers which hybridize to the 
template and are extended in a primer extension reaction. 



35 combined nucleotide sequence using fusion PCR is further 
described herein. 
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+-v,^ f^rm "vector" refers to 
Vector: As used herein, the term vec^ 

a ^acid molecule capable to trans^ -ween 
different genetic environments another nucleic a 

it has been operatively linked. One type of pre 
which it has been p nucleic acid molecule 

ferred vector is an *^' X ££Lm. Other suitable 
capable of ^-<^^ 1 ^ t1 ^ and phage, 
vectors include plasmid and cosmxd 

especially bateriophage such as lambda Prefer 

«.* o raM ble of autonomous replication anu/u. -h- 

::;: ble - trr 3 «^r r :rr: Y s 

are operatively linked are referr 
"expression vectors". 

B. mttete engineers typically 

Until this invention, genetic g ,„ llv , or 

dealt with the expression of a single gene or family (or 
population, of gene,, one at a tiee. The expression of a 
family of genes in a vector is generally referred to as a 
family oi y library will normally 

"gene library." Each member of the y 
contain a different gene or DNA sequence I—- ^ the 
vector portion of such a vector-gene fusion is typically 

Identical from member to member. (Maniatis et , 
identical j-j-^u , . hrarv mav often 

suora) individual members within the library may 
SSES&) * . 4fied before screening to 

be, and typically are, amplified bet 

• i=r-o a desired member. Amplification 
i identify and isolate a desirea bacte rial 
occurs so that each library member grows as a > acte ^ 
colony (for plasmid libraries) or phage plagues f or 
bacteriophage libraries, such as lambda) . These amplified 
Dacteriopi y "clones," since each 

me mbers are usually referred to as dor , 
„ ro ,onv or plague is made up of many identical host 



::; diff^: 

, ts vector-gene specifically hybridizes with a nucleic 
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RNA species that can be identified, for example by nucleic 
acid hybridization. The RNA species may, furthermore, be 
translated into a protein, typically by the host cell, 
that may be identified, for example, by reactivity with an 
antibody probe. Alternatively, the protein may be recog- 
nized because it binds a substrate, or catalyzes a 
reaction, or allows the host cell to survive under 
selective conditions, and so on. 

Described herein are libraries in which two or more 
families (or populations) of genes are expressed in a 
vector or a host cell in such a way that the gene combi- 
nations are randomly represented and subsequently detected 
on the basis of some property or characteristic in the 
event that a particular combination of one member from a 
first gene family and one gene from a one or more other 
gene families are combined in a vector host cell. For 
example, in the general case if there are »i» members of 
the gene family "A" and »j« members of the gene family 
"B", there will be (i) x (j) combinations of selected gene 
members A and B in the randomly created vector-gene 
population. If there are three gene families, A, B, and 
C, and a vector is made containing one member from each of 
the three gene families, the total number of combinations 
of genes will be the product of the number of A genes 
times the number of B genes times the number of c genes. 
Thus, methods are provided wherein at least two genes may 
randomly be combined, preferably on the same vector 
molecule, having been identified within a population of 
vectors containing other combinations of different genes 
from the same two or more gene families. This approach 
may be broadly accomplished by means other than recombi- 
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nucleotide sequences from each gene family. The recombi- 
nation of at least two separate library populations to 
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common restriction site or site-directed recombination 
systems, is also contemplated. 

Thus, in addition to the above-described methods, the 
invention also provides for vectors having characteristics 
5 and sequences useful for the preparation of combinatorial 
vectors encoding random DNA sequences from two or more 
gene families. Such vectors include plasmids and phage 
containing common restriction sites or sequences enabling 
the in vivo recombination of said DNA sequences from said 

10 gene families. 

The flp site-specific recombination of SL_ cervisiae 
has been described in Cox, Chapter 13 in "Genetic Recombi- 
nation," eds. R. Kucherlapati and G. Smith (American 
Society for Microbiology 1988) . Within a sixty-five bp 

15 region identified as the recombination site and designated 
FRT (flp recombination target) , there are several promi- 
nent structural features. The most important are a set of 
three bp repeats. The second and third repeats are separ- 
ated by one bp and are in the same orientation. The first 

2 0 repeat is inverted with respect to the other two and is 
separated from the second repeat by an eight bp spacer. 
The first repeat also has a one bp mismatch relative to 
the first two. Deletion analysis has demonstrated that 
the third repeat is unnecessary for recombination in 

25 vitro , although it may have a slight effect on the reac- 
tion in vivo . Additional deletions indicate that most, 
but not all, of the first and second repeats (those 
flanking the spacer) are required. While deletion of 
three bp from the distal ends of one or both of these 

30 repeats has no detectable effect on the reaction, further 
deletion leads to a gradual reduction in site function, 



2b therefore relatively small (approximately 28 bp including 
the spacer and the proximal 10 bp of each flanking 
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intermediate, or minimal FRT sequences can be utilized to 
accomplish flp-mediated site-specific recombination. 

The lambda phage attachment site is responsible for 
integration of lambda into the host chromosome. It also 
5 acts as a hot spot of recombination and lytic crosses 
between wild lambda chromosomes. As in lambda, in PI 
phage a site-specific cross over site, loxP acts as a hot 
spot of recombination. This site is recognized by the PI 
ere protein, a known site-specific protein. The site- 
10 specific recombination system is responsible for the rare 
integration of PI into the host chromosome. The cre-lox 
system of bacteriaphage PI is also useful for the site- 
specific recombination contemplated by the invention 
described and claimed herein. 
15 A transposon can jump from one vector to another 

vector or from a vector to a bacterial chromosome. 
Different transposons having different inverted repeat 
sequences and carrying, for example, different drug- 
resistance genes, can be used to carry out the desired 

2 0 random combination of genes as described herein either in 

vivo or in vitro . The transposon may, but need not, also 
contain a sequence encoding the transposase enzyme which 
catalyzes the "hop." Various suitable transposon systems 
have been described in the literature. (See, Mobile DNA , 
25 Douglas E. Berg and Martha M. Howe, eds. , American Society 
for Microbiology, Washington, D.C., 1989). One suitable 
transposon system is the gamma-delta transposon system 
which has been isolated from E. Coli. 

Thus, in addition to restriction digestion and 

3 0 ligation, use of flp type recombination systems, and 

homologous recombination, a transposon system can also be 

■ r-. * ..... ■ - . .. •..•^.Jl.l^J * i. JlJCu • . ^il,\^i.H ^ iit i 1 L- 

3d chain expression and cloning region with transposon 
terminal sequences. A library constructed in this light 
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clones from the heavy chain library. The light chain 
inserts between the terminal sequences would hop from the 
light chain lambda phage vector into other DNA sequences 
in the presence of transposase activity. Selection for 
hopping into the heavy chain clone can be accomplished by 
placing a selectable marker within the light chain, posi- 
tioned between the transposon hopping sequences, 
subsequently, phage recovered from the co-infected culture 
is plated with a strain enabling selection for the heavy 
chain vector and for the light chain marker gene. Because 
this second plating is performed under conditions of a 
high cell to phage ratio, only one lambda phage will 
typically be introduced into each cell. The lambda phage 
should grow only if the phage contains genes from both the 
15 heavy and light chain clones; most efficiently resulting 
from the transposon hop. If the hop occurs in the essen- 
tial genes of the heavy chain clone, the phage will not 
grow. Only phage containing the transposon in the proper 
position within the heavy chain will grow. A collection 
20 of these clones comprises a library of combinatorial heavy 
and light chain antibody clones. 

According to one aspect of the present invention, 
fusion PCR is used to generate two PCR-amplif ied DNA 
fragments, each of which have one of their ends modified 
25 by directed mispriming so that those ends share regions of 
complementarity, i.e., cohesive termini. When the two 
fragments are mixed, denatured and reannealed in a PCR 
cycle, the cohesive termini on two strands hybridize to 
form an "overlapping" DNA duplex that is internally 
3 0 primed. The subsequent PCR cycle primer-extends the non- 
overlapcina regions to form a hybride DNA molecule that is 



'\J «- 



35 4,965,188, and at least in several texts including "PCR 
Technology: Principles and Applications for DNA 



SUBSTITUTE SHEET 



WO 91/16427 



PCI7US91/02910 



32 

(1989) ; and "PCR Protocols" A Guide to Methods and 
Applications", Innis et al. f eds. f Academic Press, San 
Diego, California (1990). 

Thus, in one aspect of the present invention, fusion 
5 PCR is used to produce a library of dicistronic DNA mole- 
cules ocntaining upstream and downstream cistrons wherein 
first and second PCR amplification products are produced 
using respective first and second PCR primer pairs. The 
first PCR primer pair comprises a first polypeptide 
10 outside primer and a first polypeptide inside primer. 
Similarly, the second PCR primer pair comprises a second 
polypeptide outside primer and a second polypeptide inside 
primer. The first and second polypeptide inside primers 
contain complementary 5 1 -terminal sequences that allow 
15 their DNA complements to hybridize and form an internally- 
primed duplex having 3 1 -overhanging termini. The 
internally-primed duplex is then subjected to primer 
extension reaction conditions to produce a double 
stranded, dicistronic DNA having substantially blunt or 
20 blunt ends. The dicistronic DNA is then PCR amplified 
using the outside primers as a PCR primer pair. 

The dicistronic DNA molecule comprises two amino acid 
residue-coding sequences on the same strand separated by 
at least one stop codon and at least one signal sequence 
25 necessary for translation of the downstream cistron, such 
as a translation initiation codon, ribosome binding site, 
and the like. Thus, the upstream and downstream cistrons 
of the dicistronic DNA molecule are operatively linked by 
a cistronic bridge. The cistronic bridge comprises the 
3 0 genetic elements necessary to terminate translation of the 
upstream cistron and initiate translation of the down- 



3 5 cistron. The cistronic bridge coding strand preferably 
also encodes a ribosome binding site for the dowstream 
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stop codon(s). Typically, the coding strand of the 
cistronic bridge will also encode a leader polypeptide 
segment in the same translational reading frame as the 
downstream cistron. When present, the nucleotide base 
5 sequence encoding the leader usually begins with an 
initiation codon located within an operative distance, 
i.e. is operatively linked, to the ribosome binding site. 

The following discussion illustrates the use of 
fusion PCR to isolate a pair of V H and V L genes from the 

10 immunoglobulin gene repertoire. This discussion is not to 
be taken as limiting, but rather as illustrating an appli- 
cation of creating a novel phenotype by combining one 
member from each of two or more families of genes. The 
illustrated method can be used with other families of 

15 conserved genes which each for one unit of a dimeric 
receptor, whether obtained directly from a natural source, 
such naive or in vivo immunized cells, or from cells or 
one or more genes that have been treated or mutagenized in 
vitro. Generally, the method, combines the following 

20 elements: 

1. Producing V H and V L gene repertoires. 

2. Preparing sets of outside and inside polynucleo- 
tide primers for cloning polynucleotide segments 
containing immunoglobulin V H and V L region genes. 

25 3. Preparing a library containing a plurality of 

different dicistronic DNA molecules, each containing a V H 
and a V L gene from the respective repertoires. 

4. Expressing the dicistronic DNA molecules in 

suitable host cells. 

5. Screening the polypeptides expressed by the 

dicistronic DNA molecules for the preselected activity, 



30 



. ;ul t^uiu L^enLiiieu o 2 :ne screening process. 
3 5 The present invention also provides a novel 

method for screening variants of a parental clone or 
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nucleotide sequences that, when expressed together, create 
a phenotype, then such nucleotide sequences can be altered 
to create populations of variants of such nucleotide 
sequences. If the two variant populations are coexpressed 
5 in a random fashion (that is with no correlation between 
the specific alterations made in the two different nucleo- 
tide sequences) , then a combinatorial collection of such 
nucleotide sequence variants has been created. Such com- 
binatorial collections may be screened for the presence 
10 of phenotypes that are unlike the parental clone or 
clones. Generally, the method combines the following 
elements: 

1. Replicating a clone containing a nucleotide 
sequence under conditions that allow mutations to occur. 
15 2. Replicating a second clone containing a second 

nucleotide sequence under conditions that allow mutations 
to occur. 

3. Randomly combining and co-expressing the two 
mutated populations of nucleotide sequences. 
20 4. Screening clones containing combinations of 

mutated nucleotide sequences for phenotypes that were not 
present in either parent clone. 

Alternatively, the methods combine the following 
elements : 

25 1. Replicating at least portions of two nucleotide 

sequences contained within a single clone under conditions 
that allow mutations to occur in either nucleotide 
sequence. 

2. Allowing recombination events between the two 
3 0 nucleotide sequence populations to reassociate mutant 
nucleotide sequences to form new pairs of the two 



3 5 nucleotide sequences for phenotypes that were not present 
in the parent clone or in the mutant replicas of the 
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Variant 1 
Variant 2 
Variant 3 
Variant 4 
Variant 5 



For example, assume a parent clone containing two 
nucleotide sequences A and B is replicated under mutating 
conditions such that variant clones are formed: 
Parent: A/B 

Al/B 
A/Bl 
A2/B1 
A/B2 
A3/B 

10 However, within this mutated population, the combinations 
A1/B2, A2/B, A2/B2, A3/B1, and A3/B2, do not occur. If 
the mutant population (including some non-mutated parent 
clones) is allowed to recombine sequences A and B and 
their variants, then combinations such as A1/B2 , A2/B etc. 

15 can be created- Such new combinations may express a 
desired phenotype that was not present in the parental or 
the variant population. 

In one aspect, the present invention is related to 
methods for tapping the immunological repertoire by 

20 isolating from V H -coding and V L -coding gene repertoires 
genes coding for a heterodimeric antibody receptor capable 
of binding a preselected ligand. Generally, the method 
combines the following elements: 

1. Isolating nucleic acids containing a substantial 

2 5 portion of the immunological repertoire. 

2. Preparing polynucleotide primers for cloning 
polynucleotide segments containing immunoglobulin V H and V L 
region genes. 

3. Preparing a gene library containing a plurality 

3 0 of different V H and V L genes from the repertoire. 

^^^c-c. i rrr v ^ nr ^ v no! vnenti der. in * 



-j. Screening :ne expresseu po^ypepnaeb lui wiit- 
35 preselected activity, and segregating a V H - and V L -coding 

,, r ..^ n .^^ n ,H^ idersti fi<?d by the ?rreenina process. 
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In one aspect, the expressed phenotype produced by 
the methods by the present invention comprises a multi- 
meric polypeptide product (i.e. a heterodimer , etc.) which 
assumes a conformation having a binding site specific for, 
5 as evidenced by its ability to be competitively inhibited, 
a preselected or predetermined ligand such as an antigen, 
enzymatic substrate and the like. In one embodiment, the 
multimeric polypeptide is an antibody that forms an anti- 
gen binding site which specifically binds to .a preselected 
10 antigen to form an immunoreaction product (complex) having 
a sufficiently strong binding between the antigen and the 
binding site for the immunoreaction product to be iso- 
lated. The antibody typically has an affinity or avidity 
is generally greater than 10 5 -M" 1 . 
15 In another embodiment, a multimeric polypeptide 

produced according to the present invention is capable of 
binding a substrate and catalyzes the formation of a 
product from the substrate. While the topology of the 
ligand binding site of a catalying multimeric polypeptide 
20 is probably more important for its preselected activity 
than its affinity (association constant or pKa) for the 
substrate, the useful catalytic multimeric polypeptides 
typically have an association constant for the preselected 
substrate generally greater than 10 3 M" 1 , more usually 
25 greater than 10 5 M~ 1 or 10 6 M" 1 and preferably greater than 
10 7 M" 1 . 

Preferably the multimeric polypeptide produced 
according to the present invention is heterodimeric and is 
therefore normally comprised of two different polypeptide 
30 chains, which together assume a conformation having a 
binding affinity, or association constant for the pre- 



3 5 preferred aspect, one or both of the different polypeptide 
chains is derived from the variable region of the light 
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peptides comprising the light (V L ) and heavy (V H ) variable 
regions are employed together for binding the preselected 
ligand. 

A V or V produced by the methods of the subject 
■* ■ 
5 invention can be active in monomeric as well as multimenc 

forms , either homomeric or heteromeric , preferably hetero- 

dimeric. A V H and v L ligand binding polypeptide produced 

by the present invention can be advantageously combined in 

a heterodimer (antibody molecule) to modulate the activity 

10 of either or to produce an activity unique to the hetero- 
dimer. The individual ligand binding polypeptides will be 
referred to as V H and V L and the heterodimer will be 
referred to as an antibody molecule. 

However, it should be understood that a V H binding 

15 polypeptide may contain in addition to the V H , substan- 
tially all or a portion of the heavy chain constant 
region. A V L binding polypeptide may contain, in addition 
to the V L , substantially all or a portion of the light 
chain constant region. A heterodimer comprised of a V H 

20 binding polypeptide containing a portion of the heavy 
chain constant region and a V L binding containing substan- 
tially all of the light chain constant region is termed a 
Fab fragment. The production of a Fab can be advantageous 
in some situations because the additional constant region 

25 sequences contained in a Fab as compared to a F v could 
stabilize the V H and V L interaction. Such stabilization 
could cause the Fab to have higher affinity for antigen. 
In addition the Fab is more commonly used in the art and 
thus there are more commercial antibodies available to 

30 specifically recognize a Fab. 

The individual V and V. polypeptides may be produced 



peptides *n- generally nave :ewei uaai. a.uin^ 

35 residues, more usually fewer than about 120 amino acid 
residues, while normally having greater than 60 amino acid 
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residues, more usually greater than about 100 amino acid 
residues. Preferably, the V H will be from about 110 to 
about 125 amino acid residues in length while V L will from 
about 95 to about 115 amino acid residues in length. 
5 The amino acid residue sequences of the polypeptides 

will vary widely, depending upon the particular idiotype 
involved. Usually, there will be at least two cysteines 
separated by from about 60 to 75 amino acid residues and 
joined by a disulfide bond. The polypeptides produced by 
10 the subject invention will normally be substantial copies 
of idiotypes of the variable regions of the heavy and/or 
light chains of immunoglobulins, but in some situations a 
polypeptide may contain random mutations in amino acid 
residue sequences in order to advantageously improve the 

15 desired activity. 

In some situations, it is desireable to provide for 
covalent cross linking of the V H and V L polypeptides, which 
can be accomplished by providing cysteine resides at the 
carboxyl termini. The polypeptide will normally be pre- 

20 pared free of the immunoglobulin constant regions, however 
a small portion of the J region may be included as a 
result of the advantageous selection of DNA synthesis 
primers. The D region will normally be included in the 
transcript of the V H . 

2 5 In other situations, it is desirable to provide a 

peptide linker to connect the V L and the V H to form a 
single-chain antigen-binding protein comprised of a V H and 
a V L . This single-chain antigen-binding protein would be 
synthesized as a single protein chain. Such a single- 

3 0 chain antigen binding proteins have been described by Bird 

et al., Science . 242:423-426 (1988). The design of 



...... • ■fciU L. ~ LU.. - . .-. t , ...•:!. ; it^i^ne . ■ ... 

nucleic acid sequences contained in the expression vector. 
The nucleic acid sequences coding for the peptide linker 

• ' : : ► • -twee:: * ho V in-l 
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restriction endonuclease sites used to operatively link 
the V u and V, DNA homologs to the expression vector . 

Such a peptide linker also may be coded for nucleic 
acid sequences that are part of the polynucleotide primers 
5 used to prepare the various gene libraries. The nucleic 
acid sequence coding for the peptide linker can be made up 
of nucleic acids attached to one of the primers or the 
nucleic acid sequence coding for the peptide linker may be 
derived from nucleic acid sequences that are attached to 
10 several polynucleotide primers used to create the gene 
libraries. 

Typically the C terminus region of the V H and V L 
polypeptides will have a greater variety of the sequences 
than the N terminus and, based on the present strategy, 
15 can be further modified to permit a variation of the 
normally occurring V H and V L chains. A synthetic 
polynucleotide and be employed by vary one or more amino 
in an hypervariable region. 

1. Isolation Of A Gene Repe rtoire 

20 According to one aspect of the present invention, a 

gene repertoire useful in the methods the present inven- 
tion contains at least 10 3 , preferably at least 10 4 , more 
preferably at least 10 5 , and most preferably at least 10 7 
different consderved genes. Methods for evaulating the 

25 diversity of a repertoire of conserved genes are well 
known to one skilled in the art. 

Various well known methods can be employed to produce 
a useful gene repertoire. For example, to prepare a 
composition of nucleic acids containing a substantial 

in nortion of the Immunol oai cal aene repertoire r source of 



■ iiLiDod^ pioaucmy ceub, ^ . g . , l> xympnocyteb cexis,, 
preferably rearranged B cells such as those found in the 
t c, circulation or spleen of a vertebrate. fRearranqed B 
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tion, i.e., rearrangement, has occurred as evidenced by 
the presence in the cell of mRNA with the immunoglobulin 
gene V, D and J region transcripts adjacently located 
thereon . ) 

5 In some cases, it is desirable to bias the repertoire 

for a preselected activity, such as by using as a source 
of nucleic acid cells (source cells) from vertebrates in 
any one of various stages of age, health and immune 
response. For example, repeated immunization of a healthy 

10 animal prior to collecting rearranged B cells results in 
obtaining a repertoire enriched for genetic material 
producing a ligand binding polypeptide of high affinity. 
See, e.g. Mullinax et al., Proc. Nat. Acad. Sci. (USA^ 
87:8095-8099 (1990). Conversely, collecting rearranged B 

15 cells from a healthy animal whose immune system had not 
been recently challenged results in producing a repertoire 
that is not biased towards the production of high affinity 
V H and/ or V L polypeptides. 

It should be noted the greater the genetic hetero- 

2 0 geneity of the population of cells for which the nucleic 

acids are obtained, the greater the diversity of the 
immunological repertoire that will be made available for 
screening according to the method of the present inven- 
tion. Thus, cells from different individuals of different 
25 strains, races or species can be advantageously combined 
to increase the heterogeneity (diversity) of the 
repertoire. 

Thus, in one preferred embodiment, the source cells 
are obtained from a vertebrate, preferably a mammal, which 

3 0 has been immunized or partially immunized with an anti- 

genic ligand (antigen) against which activity is sought, 

. : : ■ : )\\ . '.ore - : lot.prr \ n- ^- • ~ :»Qv "imiii. . <-.«;'.. : '. :■ : 

3 5 desired, which stage corresponds to the amount of enrich- 
ment or biasing of the repertoire desired. Partially 
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and cells are collected therefrom shortly after a response 
is detected. Fully immunized animals display a peak 
titer, which is achieved with one or more repeated injec- 
tions of the antigen into the host mammal, normally at 2 
5 to 3 week intervals. Usually three to five days after the 
last challenge, the spleen is removed and the genetic 
repertoire of the spleenocytes , about 90% of which are 
rearranaed B cells, is isolated using standard procedures. 
See, Sarrent Pro i-n™i s in Molecular Biology, Ausubel et 
10 al., eds., John Wiley & Sons, NY. 

Nucleic acids coding for V H and V L polypeptides can be 
derived from cells producing IgA, IgD, IgE, IgG or IgM, 
most preferably from IgM and IgG, producing cells. 

Methods for preparing fragments of genomic DNA from 
15 which immunoglobulin variable region genes can be cloned 
as a diverse population are well known in the art. See 
for example Herrmann et al., Methods In Enzvmol . , 152:180- 
183, (1987); Frischauf, Methods In Enzvmol ♦ , 152:180-190 
(1987); Frischauf, Methods Tp Enzvmol . . 152:190-199 
20 (1987); and DiLella et al., MPf.hods In Enzvmol., 152:199- 
212 (1987) . (The teachings of the references cited herein 
are hereby incorporated by reference . ) 

The desired gene repertoire can be isolated from 
either genomic material containing the gene expressing the 
25 variable region or the messenger RNA (mRNA) which repre- 
sents a transcript of the variable region. The difficulty 
in using the genomic DNA from other than non-rearranged B 
lymphocytes is in juxtaposing the sequences coding for the 
variable region, where the sequences are separated by 
30 intervening regions. The DNA fragment (s) containing the 
„^ nr , > . ,„vi,h^ -Pninn= must- be isolated, the intervening 



most part., cms wiu oe dxiricuxL, ^ ..nu. ~nu 
3 5 technique employing rearranged B cells will be the method 
. ; ~ c borause the V. D and J immunoglobulin gene 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



42 

regions have translocated to become adjacent, so that the 
sequence is continuous for the variable regions. 

Where mRNA is utilized the cells will be lysed under 
RNase inhibiting conditions. In one embodiment, the first 
5 step is to isolate the total cellular mRNA by hybridiza- 
tion to an oligo-dT cellulose column. The presence of 
mRNAs coding for the heavy and/or light chain polypeptides 
can then be assayed by hybridization with DNA single 
strands of the appropriate genes. Conveniently, the 
10 sequences coding for the constant portion of the V H and V L 
can be used as polynucleotide probes, which sequences can 
be obtained from available sources. See for example, 
Early and Hood, Genetic Engineering , Setlow and 
Hollaender, eds., Vol. 3, Plenum Publishing Corporation, 
15 New York, (1981), pages 157-188; and Kabat et al., 
Seouences of Immunological Interest . National Institutes 
of Health, Bethesda, MD, (1987). Exemplary methods for 
producing V H and V L gene repertoires are described in PCT 
Application No. PCT/US 90/02836 (International Publication 
20 No. WO 90/14430) . 

In preferred embodiments, the preparation containing 
the total cellular mRNA is first enriched for the presence 
of V H and/or V L coding mRNA. Enrichment is typically 
accomplished by subjecting the total mRNA preparation or 
25 partially purified mRNA product thereof to a primer 
extension reaction employing a polynucleotide synthesis 
primer of the present invention. 

According to another aspect of the present invention, 
a gene repertoire may be generated from one or a few 
3 0 nucleotide sequences by replicating those sequences under 
mutagenesis conditions so that a plurality of different 
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2. Pre paratio n Of Polymic:! eotide Primers 
The term "polynucleotide" as used herein in reference 
to primers, probes and nucleic acid fragments or segments 
to be synthesized by primer extension is defined as a 
molecule comprised of two or more deoxyribonucleotides or 
ribonucleotides, preferably more than 3. Its exact size 
will depend on many factors, which in turn depends on the 
ultimate conditions of use. 

The term "primer" as used herein refers to a poly- 
nucleotide whether purified from a nucleic acid 
restriction digest or produced synthetically, which is 
capable of acting as a point of initiation of synthesis 
when placed under conditions in which synthesis of 
primer extension product which is complementary to 
15 nucleic acid strand is induced, i.e., in the presence of 
nucleotides and an agent for polymerization such as DNA 
polymerase, reverse transcriptase and the like, and at a 
suitable temperature and pH. The primer is preferably 
single stranded for maximum efficiency, but may alterna- 
20 tively be stranded. If double stranded, the primer is 
first treated to separate its strands before being used 
to prepare extension products. Preferably, the primer is 
a polydeoxyribonucleotide . The primer must be suffi- 
ciently long to prime the synthesis of extension products 
2 5 in the presence of the agents for polymerization. The 
exact lengths of the primers will depend on many factors, 
including temperature and the source of primer. For 
example, depending on the complexity of the target 
sequence, a polynucleotide primer typically contains 15 to 
25 or more nucleotides, although it can contain fewer 
nucleotides. Short primer molecules generally require 



30 



J D 



"substantially" complementary to the different strands of 
each specific sequence to be synthesized or amplified. 
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mentary to nonrandomly hybridize with its respective 
template strand. Therefore, the primer sequence need not 
reflect the exact sequence of the template. For example, 
a non-complementary nucleotide fragment can be attached to 
5 the 5' end of the primer, with the remainder of the primer 
sequence being substantially complementary to the strand. 
Such noncomplementary fragments typically code for an 
endonuclease restriction site. Alternatively, noncomple- 
mentary bases or longer sequences can be interspersed into 
10 the primer, provided the primer sequence has sufficient 
complementarily with the sequence of the strand to be syn- 
thesized to amplified to non-randomly hybridize therewith 
and thereby form an extension product under polynucleotide 
synthesizing conditions. 
15 Primers of the present invention may also contain a 

DNA-dependent RNA polymerase promoter sequence or its 
complement. See for example, Krieg et al., Nucleic Acids 
Research , 12:7057-70 (1984); Studier et al . , J. — J&Ql^. 

Biol. , 189:113-130 (1986); and Molecular Cloning: A 

20 Laboratory Manual. Second Edition , Maniatis et al., eds. , 
Cold Spring Harbor, NY (1989) . 

When a primer containing a DNA-dependent RNA poly- 
merase promoter is used, the primer is hybridized to the 
polynucleotide strand to be amplified and the second 

2 5 polynucleotide strand of the DNA-dependent RNA polymerase 

promoter is completed using an inducing agent such as 
coli DNA polymerase I, or the Klenow fragment of p. co3,i 
DNA polymerase. The starting polynucleotide is amplified 
by alternating between the production of an RNA poly- 

3 0 nucleotide and DNA plynucleotide. 

Primers may also contain a template sequence or 



* c- 1 i n s c r i e c. : " " :•' a r c .... ± Q ticcnno ^ gq v , 

35 1202 (1988), RNA-directed polymerases produce large 
numebrs of RNA strands from a small number of template RNA 
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initiation site. These polymerases typically give a one 
million-fold amplification of the template strand, as has 
been described by Kramer et al., J. flol, piol., 89:7819- 
736 (1974). 

5 The polynucleotide primers can be prepared using any 

suitable method, such as, for example, the phosphotriester 
on phosphodiester methods see Narang et al., Meth. 
Enzvmol . . 68:90, (1979); U.S. Patent No. 4,356,270; and 
Brown et al. , . Enzvmol . . 68:109, (1979). 

10 The choice of a primer's nucleotide sequence depends 

on factors such as the distance on the nucleic acid from 
the region coding for the desired receptor, its hybrid- 
ization site on the nucleic acid relative to any second 
primer to be used, the number of genes in the repertoire 

15 it is to hybridize to, and the like. 

(a) Primers for Produ cing V„ and V L DNA Homologs 

V and V, gene repertoires can be separately prepared 

H L 

prior to their use in the methods of the present inven- 
tion. Repertoire preparation is typically done by primer 

2 0 extension (or other in vitro amplif icaiton method) , 
preferably by primer extension in a PCR format. 

For example, to produce V H -coding DNA homologs by 
primer extension, the nucleotide sequence of a primer is 
selected to hybridize with a plurality of immunoglobulin 

25 heavy chain genes at a site substantially adjacent to the 
V H -coding region so that a nucleotide sequence coding for 
a functional (capable of finding) polypeptide is obtained. 
To hybridize to a plurality of different V H -coding nucleic 
acid strands, the primer must be a substantial complement 

^ of ?! nucleotide seauence conserved amona the different 



regions, preierauiy Liie u:nru iraraeworK region, .eaaei 
region, promoter region, J region and the like. 

If the V u -coding and V -codina DNA homoloqs are to be 
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two primers must be used for each coding strand of nucleic 
acid to be amplified. The first primer becomes part of 
the nonsense (minus or complimentary) strand and hybrid- 
izes to a nucleotide sequence conserved among V H (plus) 
5 strands within the repertoire. To produce V H coding DNA 
homologs, first primers are therefore chosen to hybridize 
to (i.e. be complementary to) conserved regions within the 
J region, CHI region, hinge region, C H 2 region, or C H 3 
region ot immunoglobulin genes and the like. To produce 
10 a V L coding DNA homolog, first primers are chosen to 
hybridize with (i.e. be complementary to) a conserved 
region with the J region or constant region of immuno- 
globulin light chain genes and the like. Second primers 
become part of the coding (plus) strand and hybridize to 
15 a nucleotide sequence conserved among minus strands. To 
produce the V H -coding DNA homologs, second primers are 
therefore chosen to hybridize with a conserved nucleotide 
sequence at the 5 1 end of the V H -coding immunoglobulin gene 
such as in that area coding for the leader or first frame- 
20 work region. It should be noted that in the amplification 
of both V H - and V L -coding DNA homologs, the conserved 5 1 
nucleotide sequence of the second primer can be comple- 
mentary to a sequence exogenous ly added using terminal 
deoxynucleotidyl transferase as described by Loh et al., 
25 Science 243 :217-220 (1989). One or both of the first and 
second primers can contain a nucleotide sequence defining 
an endonuclease recognition site. The site can be heter- 
ologous to the immunoglobulin gene being amplified and 
typically appears at or near the 5' end of the primer. 

30 (b) Inside and Outside Primers 

primer. Each of the inside primers has a priming region 
35 located at the 3 1 -terminus of the primer. The priming 



SUBSTITUTE SHEET 



WO 91/16427 



PCI7US91/02910 



47 



nucleotide bases. The 3 1 -terminal priming portion of each 
inside primer is capable of acting as a primer to catalyze 
nucleic acid synthesis, i.e., initiate a primer extension 
reaction off its 3» terminus. One or both of the inside 
primers is further characterized by the presence of a 5'- 
terminal (5' -most) non-priming portion, i.e., a region 
that does not participate in hybridization to repertoire 
template. 

In fusion PCR, each inside primer works in 
combination with an outside primer to amplify a target 
nucleic acid sequence. The choice of PCR primer pairs for 
use in fusion PCR as described herein is governed by the 
same considerations as previously discussed for choosing 
PCR primer pairs useful in producing gen repertoires. 
15 That is, the primers have a nucleotide sequence that is 
complementary to a sequence conserved in the repertoire. 
Useful V, and V M inside priming sequences are shown in 

L n 

Tables 1 and 2, respectively, below. 



10 



Table 1 

20 3' Priming Portions of Various I nside V L Primers 



Seq. 








Id. No. 








(D 1 


5 ' 


GTGATGACCCACTCTCC 3 




(2) 


5 1 


GTGATGACCCAGTCTCCA 


3 ' 


(3) 


5' 


GTTGTGACTCAGGAATCT 


3 ' 


(4) 


5' 


GTGTTGACGCAGCCGCCC 


3 1 


(5) 


5' 


GTGCTCACCCAGTCTCCA 


3 ' 


(6) 


5 f 


CAGATGACCCAGTCTCCA 


3 ' 


(7) 


5 ' 


GTGATGACCCAGACTCCA 


3 ' 


(8) 


5 1 


GTCATGACCCAGTCTCCA 


3 1 



the amplification of kappa light chain variable 
35 reqions. 
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Table 2 

3' Primina Portions of Var ious Inside V„ Primers 

Seq. 
Id, No. 

(ID 1 



20 



(12) ' 

(13) 3 

(14) 4 

(15) 5 
10 (16) 6 

(17) 7 
(18) 

(19) 
(20) 

15 (21) 
1 

2 
3 



25 



8 



10 



11 



4 
5 

6 

7 
8 



30 



10 



5« ACAAGATTTGGGCTC 3 ! 
5 1 TGGGGTTTTGAGCTC 3 1 
5' GAGACAGTGACCGGGTTCCTTGGCCCCA 3' 
5' TGGAATGGGCACATGCAG 3 1 
5 i TTATCATTTACCCGGAGA 3 1 
5' AACGGTAACAGTGGTGCCTTGGCCCCA 3' 
5' ACAATCCCTGGGCACAAT 3' 
5 • CACCTTGGTG CTGCTGG C 3 r 
5' ACAACCACAATCCCTGGGCACAATTTT 3 1 

5 1 ACAATCCCTGGGCACAAT 3' 
5' GAGTTCACTAGTTGGGCACGGTGGGCA 3' 
Unique 3' primer for human IgGl, 2, 3, and 4 F.2d. 
Unique 3' primer for human V H amplification. 
3» primer for amplifying human heavy chain variable 
regions . 

3' primer for amplifying the Fd region of mouse IgM. 
3 ■ primer located in the CH3 region of human IgGl to 
amplify the entire heavy chain. 
Unique 3 1 primer for amplification of mouse F v 
Unique 3' primer for amplification of mouse IgGl Fd. 
Unique 3 1 primer for amplification of VH including 
part of the mouse gamma 1 first constant region. 
Unique 3 1 primer for amplification of VH including 
part of mouse gamma 1 first constant region and hinge 
region. 

3' primer for amplifying mouse Fd including part of 
the mouse IgG first constant region and part of the 



i m a i. 



3 5 the hinge region including the two cysteines which 

create the disulfide bridge for producing Fab 1 2 (the 
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10 



A preferred set of inside primers used herein has 
primers with complementary 5 '-terminal non-priming 
regions, the complementary strands of which are capable of 
hybridizing to each other to form a duplex with 3' over- 
hangs. The duplex encodes all or part of a double 
stranded cistronic bridge. That is, if the 3' overhangs 
of the duplex are filled in with complementary bases so as 
to define a double stranded DNA extending from the 3'- 
terminus of one of the inside primers to the 3 '-terminus 
of the other of the inside primers, that double stranded 
DNA segment forms a sequence of nucleotides that opera- 
tively links the upstream and downstream cistrons for 
polycistronic expression. Thus, while each of the inside 
primers in a set contains only a portion of the sequence 
15 information necessary to form the double stranded 
cistronic bridge, the two inside primers in combination 
encode both the plus and minus strands of all or part of 
the bridge. 

For example, one inside upstream primer can have a 
20 sequence that forms a portion of the plus strand of the 
bridge, and the other inside primer encodes the sequence, 
through complementarity, of the downstream portion of the 
plus strand. 

In a preferred embodiment, the plus strand of the 
25 cistronic bridge contains, in the translational reading 
frame and from an upstream position to a downstream 
position, sequences coding for (i) at least one stop 
codon, preferably two, in the same reading frame as the 
upstream cistron, (ii) a ribosome binding site, and (iii) 
a polypeptide leader, the translation initiation codon of 
which is in the same reading frame as the downstream 



30 



A. 

3 5 cistron from the polycistronic mRNA. 

The predicted amino acid residue sequences of two 
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in Table 3. Lei, et al., supra . Amino Acid residue 
sequences for other leaders from E. coli useful in this 
invention are also listed in Table 3. Oliver, In 
Neidhart, F. C. (ed.), Escherich ia coli and Salmonella 
5 Typhimurium , American Society for Microbiology, 
Washington, D. C. , 1:56-69 (1987). These regions for the 
heavy chain are contained in the modified ImmunoZAP H 
expression vector. Mullinax, et al., proc. Na tl. Acad. 
Sci. , USA , 87:8095-8099 (1990). 

io lafeig 3 

Leader Sequences 
Seq. 

Id. No. Type Amino Acid Residue Sequence 

( 22 ) pelB 1 MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuLeu 
15 LeuLe\iAlaAlaGlnProAlaGlnProAlaMetAla 

( 23 ) pelB 2 MetLysSerLeuIleThrProIleAlaAlaGlyLeuLeu 

LeuAlaPheSerGlnTyrSerLeuAla 

( 24 ) MalE 3 MetLysIleLysThrGlyAlaArglleLeuAlaLeuSer 

AlaLeuThrThrMetMet PheS er Al a S er A 1 aLeuAl a 

20 Lyslle 

(25) OmpF 3 MetMetLysArgAsnlleLeuAlaVallleValProAla 

LeuLeuValAlaGlyThrAlaAsnAlaAlaGlu 

(26) PhoA 3 MetLysGlnSerThrlleAlaLeuAlaLeuLeuProLeu 

LeuPheThrProValThrLysAlaArgThr 
25 (27) Bla 3 MetSerlleGlnHisPheArgValAlaLeuIleProPhe 

PheAlaAlaPheCysLeuProValPheAlaHisPro 
(28) LamB 3 MetMetlleThrLeuArgLysLeuProLeuAlaValAla 

ValAlaAlaGlyValMetSerAlaGlnAlaMetAlaVal 
Asp 

3 0 ( 29 ) Lpp 3 MetLysAlaThrLysLeuValLeuGlyAlaVallleLeu 



^drotovoi ,i i en* 
pelB from Ervinia carotovora EC 16 gene 
3 5 3 leader sequences from E, coli 
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To achieve high levels of gene expression in E . coli , 
it is necessary to use not only strong promoters to gener- 
ate large quantities of mRNA, but also ribosome binding 
sites to ensure that the mRNA is efficiently translated. 
In E y coli , the ribosome binding site includes an initi- 
ation codon (AUG) and a sequence 3- nucleotides long 
located 3 11 nucleotides upstream from the initiation 
codon [Shine et al., Nature , 254:34 (1975)]. The 
sequence, AGGAGGU , which is called the shine-Dalgarno (SD) 
sequence, is complementary to the 3' end of E. coli 16S 
mRNA. Binding of the ribosome to mRNA and the sequence at 
the 3' end of the mRNA can be affected by several factors: 
(i) The degree of complementarity between the SD 
sequence and 3' end of the 16S tRNA. 
15 (ii) The spacing and possibly the DNA sequence lying 

between the SD sequence and the AUG [Roberts et al., Proc, 
Natl. Acad. Sci. USA , 76:760 (1979A) ; Roberts et al., 
Proc. Natl, Acad. Sci. USA , 76:5596 (1979B) ; Guarente et 
al., science . 209:1428 (1980); and Guarente et al., Cell, 
20 20:543 (1980).] Optimization is achieved by measuring the 
level of expression of genes in plasmids in which this 
spacing is systematically altered. Comparison of differ- 
ent mRNAs shows that there are statistically preferred 
sequences from positions -2 0 to +13 (where the A of the 
25 AUG is position 0) [Gold et al., ftnnu. Rev. Microbipl., 
35:365 (1981)]. Leader sequences have been shown to 
influence translation dramatically (Roberts et al. 1979 a, 
b supra ) . 

(iii) The nucleotide sequence following the AUG, 
which affects ribosome binding [Taniguchi et al., J. Mol. 
Biol - , 118:533 (1978) 1 . 
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Table 4 

Ribosome Binding Sites* 

Seq. 

Id. No, 

5 1. (30) 5' AAUCUU GGAGG CUUUUU UAUG GUUCGUUCU 

2. (31) 5' UAA CUAAGGAU GAAAUGCAUGUCUAAGACA 

3. (32) 5 1 UCCU AGGAGGUU UGACCUAUGCGAGCUUUU 

4. (33) 5« AUGUAC UAAGG AGGUU GUftff GG AACAACG C 

* Sequences of initiation regions for protein 
10 synthesis in four phage mRNA molecules are underlined. 
AUG = initiation codon (double underlined) 

1. = Phage 0X174 gene-A protein 

2. = Phage Q/3 replicase 

3. = Phage R17 gene-A protein 

15 4. = Phage lambda gene-cro protein 

It is preferred that the complementary (overlapping) 
region of the inside primers and the priming portion of 
the inside primers have about the same denaturation 
temperature, Td. The Td of a sequence can be estimated by 

20 the following formula: Td = 4(C+G) + 2(A+T), where C, G, 
A and T represent the respective number of cytosine f 
guanine, adenine and thymine bases in the sequence. A Td 
for the above-identified hybridizing region of about 45- 
55 °C, preferably about 50' C, is preferred. Typically, 

2 5 overlapping regions in the range of about 15 to 20 

nucleotides works well in conjunction with the priming 
regions in the range of 15-30 nucleotides. 

The set of outside primers forms the termini of the 
dicistronic DNA molecule. The set of outside primers 

3 0 comprises an upstream outside primer and a downstream 

outside primer. The outside primers each comprise a 3'- 



3 5 in a 5 ' -terminal non-priming portion of the outside 
primer. The restriction site defined by the upstream 
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a restriction enzyme that does not recognize the restric- 
tion site defined by the downstream outside primer, the 
objective being to be ^ble to produce a dicistronic DNA 
having cohesive termini that are non-compelementary to 
5 each other and thus allow directional insertion into a 
vector. 

Useful outside primer sequences are shown in Tables 
5 and 6 below. 

Table 5 
10 Outside V ([ Primers 
Seg. 
Id. No. 

( 3 4 ) 1 5 ' AGGTCCAGCTGCTCGAGTCTGG 3 1 

(35) 5 1 AGGTCCAGCTGCTCGAGTCAGG3 1 

15(36) 5 ' AGGTCCAG CTTCTCGAGTCTGG 3 1 

(37) 5 1 AGGTCCAGCTTCTCGAGTCAGG3 1 

(38) 5 ■ AGGTCCAACTGCTCG AGTCTGG 3 ■ 

(39) 5 1 AGGTCCAACTGCTCG AGTCAGG 3 ' 

(40) 5 1 AGGTCCAACTTCTCG AGTCTGG 3 1 
20(41) 5 1 AGGTCCAACTTCTCGAGTCAGG3 1 

(42) 2 5 1 AGGTGCAGCTGCTCGAGTCTGG3 1 

(43) 5 1 AGGTGCAGCTGCTCGAGTCGGG3 1 

(44) 5 1 AGGTGCAACTGCTCGAGTCTGG3 1 

(45) 5 1 AGGTGCAACTGCTCGAGTCGGG3 1 

25 1 Nucleotide sequences 21-28 are unique 5 1 primers for 
the amplification of mouse V H genes. 
2 Nucleotide sequences 29-32 are unique 5' primers for 
amplification of nucleic acids coding for human 
variable regions. 



Id. No. 

MfH 1 S • ACGTCTAGATTCCACCTTGGTCCC 3' 
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(48) 3 


5' 


GCATTCTAGACTATTAACATTCTGTAGGGGC 3 1 


(49) 4 


5' 


GCAGCATTCTAGAGTTTCAGCTCCAGCTTGCC 3 1 


(50) 5 


5' 


CCGCCGTCTAGAACACTCATTCCTGTTGAAGCT 3 1 


(51) 6 


5' 


CCGCCGTCTAGAACATTCTGCAGGAGACAGACT 3 ' 


5 (52) 7 


5 1 


GCGCCGTCTAGAATTAACACTCATTCCTGTTGAA 3 1 


(53) 8 


5' 


GCCGCTCTAGAACACTCATTCCTGTTGAA 3' 


(54) 9 


5 1 


TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 3 1 


(55) 10 


5' 


GCATTCTAGACTATTATGAACATTCTGTAGGGGC 3 1 



10 



15 



20 



25 



2 

3 



a 

9 
10 



■ 1 ^ £.*4 U««<m%^v% V-svMna r»hain Vf^yi Ahil A 

j ■ prxmer iui diup-i.xi._y muuun jvu^« — 

regions. 

3 1 primer in human kappa light chain constant region. 
3 1 primer in human lambda light chain constant 
region. 

Unique 3' primer for amplification of kappa light 
chain variable regions. 

Unique 3' primer for mouse kappa light chain 

amplification including the constant region. 

Unique 3 1 primer for mouse lambda light chain 

amplification including the constant region. 

Unique 3' primer for amplification of kappa light 

chain. 

Unique 3 1 primer for amplification of mouse kappa 
light chain. 

Unique 3' primer for kappa V L amplification. 

Unique 3' primer for human, mouse and rabbit lambda 

V L amplification. 



3 . Preparing a Gene Library 

The strategy used for cloning, i.e., substantially 
reproducing the V H and/or V L genes contained within the 
3 0 isolated repertoire will depend, as is well known in the 



. , . .; t- c v 



plurality of repertoires or populations and whether or not 
35 they are to be amplified and/or mutagnized. 
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a. Preparing V„ and V L libraries 

In one strategy, the object is to clone the V H - and/or 
V L -coding genes from a repertoire comprised of polynucleo- 
tide coding strands, such as mRNA and/or the sense strand 
5 of genomic DNA. If the repertoire is in the form of 
double stranded genomic DNA, it is usually first dena- 
tured, typically by melting, into single strands. The 
repertoire is subjected to a first primer extension 
reaction by treating (contacting) the repertoire with a 

10 first polynucleotide synthesis primer having a preselected 
nucleotide sequence. The first primer is capable of ini- 
tiating the first primer extension reaction by hybridizing 
to a nucleotide sequence, preferably at least about 10 
nucleotides in length and more preferably at least about 

15 20 nucleotides in length, conserved within the repertoire. 
The first primer is sometimes referred to herein as the 
"sense primer" because it hybridizes to the coding or 
sense strand of a nucleic acid. In addition, the second 
primer is sometimes referred to herein as the "anti-sense 

20 primer" because it hybridizes to a non-coding or anti- 
sense strand to a nucleic acid, i.e., a strand 
complementary to a coding strand. 

The PCR reaction is performed by mixing the PCR pair, 
preferably a predetermined amount thereof, with the 

25 nucleic acids of the repertoire, preferably a predeter- 
mined amount thereof, in a PCR buffer to form a first PCR 
admixture. The admixture is maintained under polynucleo- 
tide synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the formation of 

3 0 a PCR reaction product, thereby producing a gene library 
containing a plurality of different V H - and/or V, -coding 



35 one species of first primer can be paired with a number of 
second primers to form several different primer pairs. 

i i : , . . , . r ; -,. ■ < -.I-, • " i * : ■! * l s ' 

' t •* t f •' .i ■ . 4, . ■* \-» ■■ ~ ■ ■■ ' 
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primers can be used. In any case, the amplification 
products of amplifications using the same or different 
combinations of first and second primers can be combined 
to increase the diversity of the gene library. 
5 In another strategy, the object is to clone the V H - 

and/or V L -coding gene from a repertoire by providing a 
polynucleotide complement of the repertoire, such as the 
anti-sense strand of genomic dsDNA or the polynucleotide 
produced by subjecting mRNA to a reverse transcriptase 

10 reaction. Methods for producing such complements are well 
known in the art. The complement is subjected to a primer 
extension reaction similar to the above-described second 
primer extension reaction, i.e., a primer extension 
reaction using a polynucleotide synthesis primer capable 

15 to hybridizing to a nucleotide sequence conserved among a 
plurality of different V H -coding gene complements. 

The primer extension reaction is performed using any 
suitable method. Generally it occurs in a buffered aque- 
ous solution, preferably at a pH of 7-9, most preferably 

2 0 about 8. Preferably, a molar excess (for genomic nucleic 

acid, usually about 10 6 :1 primer : template) of the primer is 
admixed to the buffer containing the template strand. A 
large molar excess is preferred to improve the efficiency 
of the process. 

25 The deoxyribonucleotide triphosphates dATP, dCTP, 

dGTP, and dTTP are also admixed to the primer extension 
(polynucleotide synthesis) reaction admixture in adequate 
amounts and the resulting solution is heated to about 
90°C-100'C for about 1 to 10 minutes, preferably from 1 

3 0 to 4 minutes. After this heating period the solution is 

allowed to cool to room temperature, which is preferable 



J .J 



under conditions known in the art. The synthesis reaction 
may occur at from room temperature up to a temperature 
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10 



15 



20 



25 



ciently. Thus, for example, if DNA polymerase is used as 
inducing agent, the temperature is generally no greater 

than about 40 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of prxmer 
extension products, including enzymes. Suitable enzymes 
for this purpose include, for example, i^coli, DNA poly- 
merase I, Klenow fragment of E^coii DNA polymerase I, T4 

^, „.,4i.m 0 nwj nni vmprases , reverse 

DNA polymerase, Ou lci " c ~ 

transcriptase, and other enzymes, including heat-stable 
enzymes, which will facilitate combination of the nucleo- 
tides in the proper manner to form the primer extension 
products which are complementary to each nucleic acid 
strand. Generally, the synthesis will be initiated at the 
3- end of each primer and proceed in the 5- direction 
along the template strand, until synthesis terminates, 
producing molecules of different lengths. There may be 
inducing agents, however, which initiate synthesis at the 
5 . end and proceed in the above direction, using the same 
process as described above. 

The inducing agent also may be a compound or system 
which will function to accomplish the synthesis of RNA 
primer extension products, including enzymes. in 
preferred embodiments, the inducing agent may be a DNA- 
dependent RNA polymerase such as T7 RNA polymerase, T3 RNA 
polymerase or SP6 RNA polymerase. These polymerases 
produce a complementary RNA polynucleotide. The high turn 
overrate of the RNA polymerase amplifies the starting 
polynucleotide as has been described by Chamberlin et al. , 

~* p nnver PP 87-108< Academic Press, New 
, The Enzvmes , ed. P. Boyer, rr. »' 

York (1982). Another advantage of T7 RNA polymerase is 



35 



transcribing the partially-mismatched template directly a, 
has been previously described by Joyce et al., Nucleic 

i _-->: '10?.^ . Amplification systems 
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based on transcription have been described by Gingeras et 
al», in PCR Protocols, A Guide to Methods and 
Applications . PP. 245-252, Academic Press, Inc. , San 
Diego, CA (1990) . 
5 If the inducing agent is a DNA-dependent RNA 

polymerase and therefore incorporates ribonucleotide 
triphosphates, sufficient amounts of ATP, CTP, GTP and UTP 
are admixed to the primer extension reaction admixture and 
the resulting solution is treated as described above-. 

10 The newly synthesized strand and its complementary 

nucleic acid strand form a double-stranded molecule which 
can be used in the succeeding steps of the process. 

The first and/or second primer extension reaction 
discussed above can advantageously be used to incorporate 

15 into the multimeric polypeptide a preselected epitope 
useful in immunologically detecting and/or isolating a 
multimeric polypeptide. This is accomplished by utilizing 
a first and/or second polynucleotide synthesis primer or 
expression vector to incorporate a predetermined amino 

20 acid residue sequence into the amino acid residue sequence 
of the receptor. 

After producing V H - and/or V L -coding DNA homologs for 
a plurality of different V H - and/or V L -coding genes within 
the repertoire, the homologs are typically amplified. 

25 While the V H and/or V L -coding DNA homologs can be amplified 
by classic techniques such as incorporation into an auto- 
nomously replicating vector, it is preferred to first 
amplify the DNA homologs by subjecting them to a polymer- 
ase chain reaction (PCR) prior to inserting them into a 

3 0 vector. In fact, in preferred strategies, the first 
and/or second primer extension reactions used to produce 
the gene library are the first and qprnnr) n**^* 31- nv*- n~<- - 

. i-.iw. — iiieuubx. pel lotTuiny ...ii one admixture, trie above 
described first and second primer extension reactions, 
each cycle comprising polynucleotide synthesis ] nvpn h ,r 
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10 



denaturation of the double stranded polynucleotides 
formed. Methods and systems for amplifying a DNA homolog 
are described in U.S. Patents No. 4,683,195 and No. 
4,683,202, both to Mullis et al. Preferably, PCR is 
carried out by thermocycling i.e., repeatedly increasing 
and decreasing the temperature of a PCR reaction admixture 
within a temperature range whose lower limit is about 10 °C 
to about 50 'C and whose upper limit is about 90 "C to about 
100 °C. The increasing and decreasing can be continuous, 
but is preferably phasic with time periods of relative 
temperature stability at each of temperatures favoring 
polynucelotide synthesis, denaturation and hybridization. 
In preferred embodiments only one pair of first and 
second primers is used per amplification reaction. The 
15 amplification reaction products obtained from a plurality 
of different amplifications, each using a plurality of 
different primer pairs, are then combined. 

However, the present invention also contemplated DNA 
homolog production via co-amplification (using two pairs 
20 of primers) , and multiplex amplification (using up to 
about 8, 9 or 10 primer pairs). 

The V- and V. -coding DNA homologs produced by PCR 

H » 

amplification are typically in double-stranded form and 
have contiguous or adjacent to each of their termini a 
25 nucleotide sequence defining an endonuclease restriction 
site. Digestion of the V H - and V L -coding DNA homologs 
having restriction sites at or near their termini with one 
or more appropriate endonucleases results in the produc- 
tion of homologs having cohesive termini of predetermined 

30 specificity. 

In preferred embodiments, the PCR process is used not 



heterogeneity. First, it should be noted that the PCR 
processes itself is inherently mutagenic due to a variety 
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the mutation inducting variations described in the above 
referenced U.S. Patent No. 4 , 683, 195 , other mutation 
inducing PCR variations can be employed. For example, the 
PCR reaction admixture, i.e., the combined first and 
5 second primer extension reaction admixtures, can be formed 
with different amounts of one or more of the nucleotides 
to be incorporated into the extension product. Under such 
conditions, the PCR reaction proceeds to produce nucleo- 
tide substitutions within the extension product as a 

10 result of the scarcity of a particular base. Similarly, 
approximately equal molar amounts of the nucleotides can 
be incorporated into the initial PCR reaction admixture in 
an amount to efficiently perform X number of cycles, and 
then cycling the admixture through a number of cycles in 

15 excess of X, such as, for instance, 2X. Alternatively, 
mutations can be induced during the PCR reaction by incor- 
porating into the reaction admixture nucleotide deriva- 
tives such as inosine, not normally found in the nucleic 
acids of the repertoire being amplified. During subse- 

20 quent in vivo amplification, the nucleotide derivative 
will be replaced with a substitute nucleotide thereby 
inducting a point mutation. 

b. Preparing a Dicistronic DNA molecule Library 
In one embodiment, a library of dicistronic DNA 

2 5 molecules containing upstream and downstream cistrons 

operatively linked by a cistronic bridge can be produced 
by the following steps: 

(a) Subjecting a repertoire of first polypeptide 
genes (e.g., V H -coding genes), to PCR amplification using 
30 first outside and first inside primers, i.e., a first PCR 

. i U l 1 17. U .... \ i \ '. . : . i i i ■ 

second outside and second inside pinners, i.e., a second 

3 5 PCR primer pair, to form a second primary PCR product. 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US9 1/02910 



61 



10 



(c) Hybridizing the first and second primary PCR 
products to form internally (self) primed duplexes, i.e., 
duplexes having 3 • -hybridized and 5 ' -overhanging termini. 

(d) Subjecting the internally-primed duplexes to 
primer extension reaction conditions to form double 
stranded duplexes having substantially blunt, preferably 
blunt, termini and a dicistronic strand containing the 
upstream and downstream cistrons linked by a cistronic 
bridge encoded by the inside primers. By "substantially 
blunt" is meant having no more than about one or two 
overhanging nucleotides. (Substantialloy blunt double 
stranded DNA is sometimes produced by primer overextension 
by Taq polymerase, usually by the addition of one or two 
terminal adenine residues . ) 

15 The V H - and V L -coding gene repertoires are comprised 

of polynucleotide coding strands, such as mRNA and/ or the 
sense strand of genomic DNA. If the repertoire is in the 
form of double stranded genomic DNA, it is usually first 
denatured, typically by melting, into single strands. A 

20 repertoire is subjected to a PCR reaction as described in 
Section 3a hereinabove. 

In preferred embodiments the ratio of gene molecules 
and their respective primers is as follows: about 1 x 10 
V H gene molecules to about 1 x 10 8 outside V„ gene molecules 

25 to about 1 x 10 8 outside V H primer molecules, about 1 x 10 
V qene molecules, to about 1 x 10 7 inside V H gene primer 

H ~ g 

molecules, about 1 x 10 3 V L gene molecules to about 1 x 10 
outside V L gene primer molecules, about 1 x 10 4 V L gene 
molecules to about 1 x 10 7 V L gene primer molecules. In 
30 more preferred embodiments, 10 4 outside V H gene primer 
molecules and 10 3 inside V H gene primer molecules are used 



is typically a 10 tola nioiar excess oi outside primer to 
inside primer. 
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In the fusion PCR reaction, the gene repertoires are 
admixed with outside and inside primers, the outside 
primers being present in excess relative to the inside 
primers. The initial PCR thenaocycles produce intermedin 
5 ate products having complementary termini from each of the 
first and second gene repertoires. That is, the end of 
one strand from one primary PCR product is capable of 
hybridizing with the complementary end from the other 
primary PCR product. The strands having the overlap at 

10 their 3 1 ends can act as primers for one another, i.e., 
from an internally primed duplex, and be extended by the 
polymerase to form the full length final product. The 
final product is then amplified by the set of outside 
primers, which act as a third PCR pair when the inside 

15 primers have been exhausted, to form a secondary PCR 
product. Typically the molar ratio of outside primers to 
inside primers is such that the inside primers are 
effectively exhausted within about 2 to about 12, 
preferably about 5, 6 or 7 thermocycles . 

20 The PCR buffer also contains the deoxyr ribonucleotide 

triphosphates dATP, dCTP, dGTP, and a polymerase, typic- 
ally thermostable, all in adequate amounts for primer 
extension (polynucleotide synthesis) reaction. The 
resulting solution (PCR admixture) is heated to about 90 *C 

25 - 100 °C for about 1 to 10 minutes, preferably from 1 to 4 
minutes. After this heating period the solution is 
allowed to cool to 54 "C, which is preferably for primer 
hybridization. The synthesis reaction may occur at from 
room temperature up to a temperature above which the poly- 

30 merase (inducing agent) no longer functions efficiently. 
Thus, for example, if DNA polymerase is used as inducing 

v wi/voij gelatin, 200 /iM dATP; 200 dTTP; 200 dCTP; 
200 /iM dGTP; and 2.5 units Thermus accruaticus DNA poly- 
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merase I (U.S. Patent No. 4,889,818) per 100 microliters 
of buffer. 

After producing op^ratively linked V H - and V L -coding 
DNA homologs for a plurality of different V H - and V L - 
5 coding genes within the repertoires, the dicistronic DNA 
molecules are typically further amplified. While the 
dicistronic DNA molecules can be amplified by classic 
techniques such as incorporation into an autonomously 
replicating vector, it is preferred to first amplify the 

10 molecules by subjecting them to a polymerase chain reac- 
tion (PCR) prior to inserting them into a vector. In 
fact, in preferred strategies, the first and second PCR 
reactions are performed in the same admixture that is 
subject to a multiplicity of PCR thermocycles where the 

15 outside primers are in molar excess. Preferably the 
number of PCR thermocycles is at least n+5, wherein n is 
the number of PCR thermocycles necessary to decrease by a 
factor of 10, and preferably exhaust, the number of inside 
primers by consumption in the formation of inside primer- 

20 primed products. 

A diverse library of dicistronic DNA molecules having 
upstream and downstream cistrons can also be produced by 
combining, in a PCR buffer, double stranded V H and V L 
repertoires, V H and V L outside primers, and an inside 

25 primer having a 3 ! -teminal priming portion, a cistronic 
bridge coding portion, and a 5 '-terminal inside primer- 
template (primer-coding) portion. The 3 ' -terminal priming 
portion has a nucleotide base sequence complementary to a 
portion of the primer extension product of one of the 

3 0 outside primers. The 5' -terminal primer-template portion 
has a nucleotide base sequence homologous (identical) to 



3 5 toires. The cistronic bridge coding portion codes for, 
either directly or through complementarily , at least one 
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cistron and sequences for the expression of the downstream 
cistron. 

The dicistronic DNA molecules containing operatively 
linked V u - and V. -coding DNA homologs produced by PCR 
5 amplification are typically in double-stranded in form and 
may have contiguous or adjacent to each of their termini 
a nucleotide sequence defining an endonuclease restriction 
site. Digestion of the dicistronic DNA molecules having 
restriction sites at or near their temini with one or more 
10 appropriate endonucleases results in the production of DNA 
molecules having cohesive termini of predetermined 
specificity. 

When individual PCR admixtures contain diverse gene 
repertoires the present invention produces many non- 
15 naturally occurring antibodies, i.e., combinations of V H 
and V L in a heterodimer. To take advantage of the 
mammalian immune system's capacity to select V H and V L 
combinations, the present invention also contemplates 
using fusion PCR to operatively link, and thereby recover, 
20 naturally occurring V H and V L combinations - 

In certain preferred embodiments, a fusion PCR method 
is performed on repertoires comprising a plurality of 
substantially isolated cells containing genes coding for 
a heterodimeric receptor. For example, a plurality of PCR 
25 admixtures is formed, each of which contains (i) a sample 
of substantially isolated B lymphocytes from a mammal pro- 
ducing antibody molecules against a preselected antigen, 
(ii) a PCR buffer, and (iii) either the previously 
described V H and V L PCR primer pairs or the set of outside 
3 0 V H and V L PCR primers in combination with the linking 
primer(s) , also as previously described. The plurality of 

35 containing less than about 100 target cells, such as B 
lymphocytes, T cells, and the like. In preferred embodi- 
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one cell. The cells are typically obtained from an indi- 
vidual mammal whose serum contains antibody molecules 
against the preselcted antigen. The collected cells are 
typically seeded, usually at densities in the range of 0.5 
5 to 100 cells per unit volume, into a plurality of indi- 
vidual PCR vessels, such as microtiter plate wells and the 
like. Usually, the plurality of PCR admixtures is in the 
range of 800 to 1200, and preferably is about 1000, 
separate admixtures. 

10 Typically, fewer cells are needed in each PCR 

admixture where the cells are obtained from individuals 
expressing a high serum antibody titer against the pre- 
selected antigen. For example, where B lymphocytes are 
obtained from an individual having a frequency of circu- 

15 lating B cells producing the antibody molecules of 
preselected specificity of 1/3000, each of about 800 to 
1200 individual PCR admixtures need only contain about one 
B lymphocyte to result in isolation of the desired anti- 
body. Where the circulating B cell frequency is in the 

20 range of 1/500,000, a density of about 100 cells per PCR 
admixture in each of about 800 to 1200 individual PCR 
admixtures will be needed before the process will result 
in isolation of the desired antibody. 

In preferred embodiments, the PCR process is used not 

25 only to produce a library of dicistronic DNA molecules, 
but also to induce mutations within the library or to 
create diversity from a single parental clone and thereby 
provide a library having a greater heterogeneity as noted 
in Section 3a hereinabove. 

3 0 4 . Expression 



operatively linked to a vector for amplification and/or 
35 expression. 
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The choice of vector to which a V H - and/ or V L -coding 
DNA homolog is operatively linked depends directly, as is 
well known in the art, on the functional properties 
desired, e.g., replication or protein expression, and the 
5 host cell to be transformed, these being limitations 
inherent in the art of constructing recombinant DNA 
molecules. In preferred embodiments, the vector utilized 
includes a procaryotic replicon i.e., a DNA sequence 
having the ability to direct autonomous replication and 
10 maintenance of the recombinant DNA molecule extra chromo- 
somally in a procaryotic host cell, such as a bacterial 
host cell, transformed therewith. Such replicons are well 
known in the art. In addition, those embodiments that 
include a procaryotic replicon also include a gene whose 
15 expression confers a selective advantage, such as drug 
resistance, to a bacterial host transformed therewith. 
Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin or tetracycline. 

Those vectors that include a procaryotic replicon can 
20 also include a procaryotic promoter capable of directing 
the expression (transcription and translation) of the V H - 
and/or V L -coding homologs in a bacterial host cell, such 
as E. coli transformed therewith. A promoter is an 
expression control element formed by a DNA sequence that 
25 permits binding of RNA polymerase and transcription to 
occur. Promoter sequences compatible with bacterial hosts 
are typically provided in plasmid vectors containing 
convenience restriction sites for insertion of a DNA 
segment of the present invention. Typical of such vector 
30 plasmids are pUC8, pUC9 , pBR322, and pBR329 available from 
BioRad Laboratories, (Richmond, CA) and pPL and pKK2 23 

•■ -v.. .. ^Lerrov. i. .. vj j\ ;ii 

jtner about Jb bp (-35 region) upstream from the point at 
which transcription starts. These two regions typically 

determine promoter strength,. In addition., the r.umbei 
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nucleotides atht separate the conserved sequences is 
important for efficient promoter function. For example, 
16 to 19 nucleotides typically separate the -10 and -3 5 
regions, and changes in that psacing can change the 
5 efficiency of a promoter. 

Promoters useful in this invention include Ptac 
<p 1.1A, <t> LIB and <p 10, which are recognized by T7 
polymerase. See U.S. Patent No. 4,946,786. Useful 
reguiatabie promoters include the E. coii lac promoter 

10 described in U.S. Patent No. 4,936,786 and the promoters 
for the temperature sensitive genes in U.S. Patent No. 
4,806,471. See also U.S. Patent No. 4,711,845. 

Expression vectors compatible with eukaryotic cells, 
preferably those compatible with vertebrate cells, can 

15 also be used. Eukaryotic cell expression vectors are well 
known in the art and are available from several commercial 
sources. Typically, such vectors are provided containing 
convenient restriction sites for insertion of the desired 
DNA homologue. Typical of such vectors are pSV L and pKSV- 

20 10 (Pharmacia), pBPV-l/PML2d (International 
Biotechnologies, Inc.), and pTDTl (ATCC, No. 31255). 

In preferred embodiments, the eukaryotic cell 
expression vectors used include a selection marker that is 
effective in an eukaryotic cell, preferably a drug resist- 

2 5 ance selection marker. A preferred drug resistance marker 
is the gene whose expression results in neomycin resist- 
ance, i.e., the neomycin phosphotransferase (neo) gene. 
Southern et al., J. Mol. ApdI- Genet. . 1:327-341 (1982). 
The use of retroviral expression vectors to express 

30 the genes of the V H and/ or V L -coding DNA homologs is also 
contemplated. As used herein, the term "retroviral 
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5 In preferred embodiments, the expression vector is 

typically a retroviral expression vector that is prefer- 
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construction and use of retroviral vectors has been 
described by Sorge et al., flo? s Cel. Biol., 41730-1737 
1984) . 

A variety of methods have been developed to opera- 
5 tively link DNA to vectors via complementary cohesive 
termini. For instance, complementary cohesive termini can 
be engineered into the V H - and/or V L -coding DNA homologs 
during the primer extension reaction by use of an appro- 
priately designed polynucleotide syndesis primer, 
10 previously discussed. The vector, and DNA homolog if 
necessary, is cleaved with a restriction endonuclease to 
produce termini complementary to those of the DNA homolog. 
The complementary cohesive termini of the vector and the 
DNA homolog are then operatively linked (ligated) to 
15 produce a unitary double stranded DNA molecule. 

In preferred embodiments, the V H -coding and V L -coding 
DNA homologs of diverse libraries are randomly combined in 
vitro for polycistronic expression from individual 
vectors. That is, a diverse population of double stranded 
20 DNA expression vectors is produced wherein each vector 
expresses, under the control of a single promoter, one V H - 
coding DNA homolog and one V L -coding DNA homolog, the 
diversity of the population being the result of different 
V - and V, -coding DNA homolog combinations . 

H U 

2 5 Random combination in vitro can be accomplished using 

two expression vectors distinguished from one another by 
the location on each of a restriction site common to both. 
Preferably the vectors are linear double stranded DNA, 
such as a Lambda Zap derived vector as described herein. 

3 0 In the first vector, the site is located between a promo- 

ter and a polylinker, i.e., 5 1 terminal (upstream relative 
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jb between a promoter and the restriction site, i.e., the 
restriction site is located 3 1 terminal to the polylinker, 
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10 



In preferred embodiments, each of the vectors defines 
a nucleotide sequence coding for a ribosome binding and a 
leader, the sequence being located between the promoter 
and the poly linker, but downstream (3' terminal) from the 
shared restriction site if that site is between the promo- 
ter and poly linker. Also preferred are vectors containing 
a stop codon downstream from the polylinker, but upstream 
from any shared restriction site if that site is down- 
stream from the polylinker. The first and/or second 
vector can also define a nucleotide sequence coding for a 
peptide tag. The tag sequence is typically located down- 
stream from the polylinker but upstream from any stop 
codon that may be present. 

In preferred embodiments, the vectors contain 
15 selectable markers such that the presence of a portion of 
that vector, i.e. a particular lambda arm, can be selected 
for or selected against. Typical selectable markers are 
well known to those skilled in the art. Examples of such 
markers are antibiotic resistance genes, genetically 
selectable markers, mutation suppressors such as amber 
suppressors and the like. The selectable markers are 
typically located upstream of the promoter and/ or down- 
stream of the second restriction site. In preferred 
embodiments, one selectable marker is located upstream of 
25 the promoter on the first vector containing the V H -coding 
DNA homologs. A second selectable marker is located down- 
stream of the second restriction site on the vector con- 
taining the V L -coding DNA homologs. This second selectable 
marker may be the same or different from the first as long 
30 as when the V H -coding vectors and the V L -coding vectors are 
randomly combined via the first restriction site the 



20 



that defines one or more, preferably at least two, 
restriction sites, each unique to the vector, i.e., if it 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



70 

The polylinker restriction sites are oriented to permit 
ligation of v H - or V L -coding DNA homologs into the vector 
in same reading frame as any leader, tag or stop codon 
sequence present. 
5 Random combination is accomplished by ligating V H - 

coding DNA homologs into the first vector, typically at a 
restriction site or sites within the poly linker. 
Similarly, V L -coding DNA homologs are ligated into the 
second vector, thereby creating two diverse populations of 
10 expression vectors. It does not matter which type of DNA 
homolog, i.e., V H or V L , is ligated to which vector, but it 
is preferred, for example, that all V H -coding DNA homologs 
are ligated to either the first of second vector, and all 
of the V L -coding DNA homologs are ligated to the other of 
15 the first or second vector. The members of both popula- 
tions are then cleaved with an endonuclease at the shared 
restriction site, typically by digesting both populations 
with same enzyme. The resulting product is two diverse 
populations of restriction fragments where the members of 
20 one have cohesive termini complementary to the cohesive 
termini of the members of the other. The restriction 
fragments of the two populations are randomly ligated to 
one another, i.e., a random, interpopulation ligation is 
performed, to produce a diverse population of vectors each 
25 having a V H -coding and V L -coding DNA homolog located in the 
same reading frame and under the control of second 
vector's promoter. Of course, subsequent recombinations 
can be effected through cleavage at the shared restriction 
site, which is typically reformed upon ligation of members 
3 0 from the two populations, followed by subsequent 
religations. 

rpV> v- o f~t > > 1 * ■ . . . * 

separately or in combination. When coexpressed within the 
same organism, either on the same or the difference 
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10 



V H and V L polypeptides are expressed in different organ- 
isms, the respective polypeptides are isolated and then 
combined in an appropriate medium to form a Fv. Cellular 
hosts into which a V H - and/or V L -coding DNA homolog- 
containing construct has been introduced are referred to 
herein as having been "transformed" or as "transf ormants" . 

The host cell can be either procaryotic or eucary- 
otic. Bacterial cells are preferred procaryotic host 

cells ana nypicany cme a suam ^j. ±±* — ~~~~ — , 

example, the E, coli strain DH5 available from Bethesda 
Research Laboratories, Inc., Bethesda, MD. Preferred 
eucaryotic host cells include yeast and mammalian cells, 
preferably vertebrate cells such as those from a mouse, 
rat, monkey or human cell line* 
15 Transformation of appropriate cell hosts with a 

recombinant DNA molecule of the present invention is 
accomplished by methods that typically depend on the type 
of vector used. With regard to transformation of procary- 
otic host cells, see, for example, Cohen et al., 
20 Proceedings National Acad emy of Science, USA Vol. 69, P. 
2110 (1972); and Maniatis et al., flojecular Cloning , — a 
laboratory manual . Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1982). With regard to the transform- 
ation of vertebrate cells with retroviral vectors 
25 containing rDNAs , see for example, Sorge et al. , Mol^ 
Cell. Biol. , 4:1730-1737 (1984); Graham et al., Virol. , 
52:456 (1973); and Wigler et al., Proceedings National 
Academy of Sciences , USA, Vol. 76, P. 1373-1376 (1979). 



30 



b. Expressing the Dicistro nic DNA Molecules 

The dicistronic DNA molecules produced by the above 



tively link DNA to vectors via complemenetary cohesive 
35 termini. For instance, complementary cohesive teraini can 
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the primer extension reaction by use of an appropriately 
designed polynucleotide synthesis primer, as previously 
discussed. The dicistronic DNA molecule, and vector if 
necessary, is cleaved with a restriction endonuclease to 
5 produce termini complementary to htose of the vector. The 
complementary cohesive termini of the vector and the 
dicistronic DNA molecule are then operatively linked 
(ligated) to produce a unitary double stranded DNA 
molecule. 

10 The present method produces a diverse population of 

double stranded DNA expression vectors wherein each vector 
expresses, under the control of a single promoter, one V H - 
coding DNA homolog and one V L -coding DNA homolog, the 
diversity of the populuation being the result of different 

15 V u - and V, -coding DNA homolog combination that occurs 

n L 

during the PCR reaction where both outside and both inside 
primers are present in effective amounts. Preferably the 
vectors are linear double stranded DNA, such as a Lambda 
Zap derived vector as described herein. 
2 0 In preferred embodiments, the vector defines a 

nucleotide sequence coding for a ribosome binding site and 
a leader, the sequence being located downstream from a 
promoter and upstream from a sequence ocding for apoly- 
peptide leader. In preferred embodiments, the vector 

2 5 contains a selectable marker such that the presence of a 

dicistronic DNA molecule of this invention inserted into 
the vector, can be selected. Typical selectable markers 
are well known to those skilled in the art. Examples of 
such markers are antibiotic resistance genes, genetically 

3 0 selectable markers, mutation suppressors such as amber 

supppressors and the like. The selectable markers are 



sion of the V H - and V L ~coding DNA homologs. When 
coexpressed within the same organism, a functionally 
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Cellular hosts into which a V H - and V L -coding DNA homolog- 
containing constructu has been introduced are referred to 
herein as having been "transformed" or as " trans formants" . 
The host cell can be either prokaryotic or 
5 eukaryotic. Bacterial cells are preferred prokaryotic 
host cells for library screening, and typically are a 
strain of E. coli such as, for example, the E. coli strain 
DH5 available from Bethesda Research Laboratories, Inc. , 
Bethesda, MD. Preferred eukaryotic host cells include 
10 yeast and mammalian cells, preferably vertebrate cells 
such as those from a mouse, rat, monkey or human cell 
line. 

Transformation of appropriate cell hosts with a 
recombinant DNA molecule of the present invention is 

15 accomplished by methods that typically depend on the type 
of vector used. With regard to transformation of prokary- 
otic host cells, see, for example, Cohen et al., Prpg. 
Natl. Acad. Sci. , USA, 69:2110 (1972); and Maniatis et 
al., Molecular Cloning; A Labor atory Manual, Cold Spring 

20 Harbor, NY (1982). With regard to the transformation of 
vertebrate cells with retorviral vectors containing rDNAs , 
see for example, Sorge et al., Mol. Cell . Biol., 4:1730- 
1737 (1984); Graham et al., Virol . , 52:456 (1973); and 
Wigler et al., Proc. Natl. Acad. Sci. , USA, 76:1373-1376 

25 (1979). 

5. Screening For Exp rp^inn of V,, and/or V, Polypeptides 
Successfully transformed cells, i.e., cells contain- 
ing a V H - and/or V L -coding DNA homolog or a dicistromic DNA 
molecule operatively linked to a vector, can be identified 
3 0 by any suitable well known technique for detecting the 
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the binding of ligand by the receptor produces a detect- 
35 able signal, either directly or indirectly. Such signals 
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formation of a catalytic reaction product, the release or 
uptake of energy, and the like. For example, cells from 
a population subjected to transformation with a subject 
rDNA can be cloned to produce monoclonal colonies. Cells 
5 form those colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using a 

method such as that described Southern, J . Mol . Biol . , 

98:503 (1975) or Berent et al., Biotech . 3:208 (1985). 

In addition to directly assaying for the presence of 

10 a V H - and/or V L -coding DNA homolog or a dicistronic DNA 
molecule, successful transformation can be confirmed by 
well known immunological methods, especially when the V H 
and/or V L polypeptides produced contain a preselected 
epitope. For example, samples of cells suspected of being 

15 transformed are assayed for the presence of the 
preselected epitope using an antibody against the epitope. 

6. Vj, - And/Or V L -Codina Gene Libraries 

According to one aspect, the present invention 
contemplates a gene library, preferably produced by a 

20 primer extension reaction or combination of primer 
extension reactions as described herein, containing at 
least about 10 3 , preferably at least about 10 4 and more 
preferably at least about 10 5 different V H - and/or VL- 
coding DNA homologs. The homologs are preferably in an 

25 isolated form, that is, substantially free of materials 
such as, for example, primer extension reaction agents 
and/or substrates, genomic DNA segments, and the like. 

In preferred embodiments, a substantial portion of 
the homologs present in the library are operatively linked 

30 to a vector, preferably operatively linked for expression 
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containing buffering salts, and the like. The medium 
35 should be compatible with maintaining the activity of the 
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a concentration sufficient to allow transformation of a 
host cell compatible therewith at reasonable frequencies. 

It is further preferred that the homologs be present 
in compatible host cells transformed therewith. 

5 C. Expression Vectors 

The present invention also contemplates various 
expression vectors useful in performing/ inter alia, the 
methods of the present invention. Each of the expression 
vectors is a novel derivative of Lambda Zap vector. 

10 1. Lambda Zap II 

Lambda Zap II is prepared by replacing the Lambda S 
gene of the vector Lambda Zap with the Lambda S gene from 
the Lambda gtlO vector, as described in Example 6. 

2 . Lambda Zap II V u 

15 Lambda Zap II V H is prepared by inserting the 

synthetic DNA sequences illustrated in Figure 6A into the 
above-described Lambda Zap II vector. The inserted 
nucleotide sequence advantageously provides a ribosome 
binding site (Shine-Dalgarno sequence) to permit proper 

20 initiation of mRNA translation into protein, and a leader 
sequence to efficiently direct the translated protein to 
the periplasm. The preparation of Lambda Zap II V H is 
described in more detail in Example 9, and its features 
illustrated in Figures 6A and 7. 

2 5 3 . Lambda Zap II V L 

Lambda Zap II V L is prepared as described in Example 
12 by inserting into Lambda Zap II the synthetic DNA 
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4. Lambda Zap II V L II 

Lambda Zap II V L II is prepared as described in 
Example 11 by inserting into Lambda Zap II the synthetic 
DNA sequence illustrated in Figure 10- 

5 5. HCFLP 

HCFLP is prepared as described in Example 20 by 
inserting a flp sequence containing EcoRI compatible ends 
into the ecori site or the lambda zap II v H vector. 

6. LCFLP 

10 LCFLP is prepared as described in Example 20 by 

inserting a flp sequence containing EcoRI compatible ends 
into the EcoRI site of the lambda Zap II V L vector. 

7 . Lambda ImmunoZAP H 

Lambda ImmunoZAP H is prepared by inserting the 
15 synthetic DNA sequences illustrated in Figure 25A into the 
above-described Lambda Zap II vector. The inserted 
nucleotide sequence advantageously provides a ribosome 
binding site (Shine-Dalgarno sequence) to permit proper 
initiation of mRNA translation into protein, and a leader 
20 sequence to effieicntly direct the translated protein to 
the periplasm. The preparation of Lambda ImmunoZAP H is 
described in more detail in Example 28, and its features 
illustrated in Figures [25A] and [26]. 

8. Modified Lambda ImmunoZAP H 

25 Modified Lambda ImmunoZAP H is prepared by inserting 

the modified synthetic DNA sequences illustrated in Figure 
8A into the above-described Lambda ZAP II vector. The 
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9 . Lambda TmmunoZAP L 

Lambda ImmunoZAP L is prepared as described in 
Example 29 by inserting into Lambda ZAP II the synthetic 
DNA sequence illustrated in Figure 6B. Important features 
5 of Lambda ImmunoZAP L are illustrated in Figure 27. 

The above-described vectors are compatible with L 
coli hosts, i^, they can express for secretion into the 
periplasm proteins coded for by genes to which they have 
been operatively linked for expression. 

10 Examples 

The following examples are intended to illustrate, 
but not limit, the scope of the invention. 

1. Phenotvpe Creation 

In order to obtain lambda phage clones with a range 

15 of desired phenotypes, a combinatorial library selection 
system was used to generate a diverse collection of 
clones. This approach utilized two starting populations 
of lambda phage clones which can be restriction digested, 
mixed, ligated, and packaged to form a library of clones 

20 containing DNA sequences from each of the two populations 
of parent phage. The following example outlines the 
method for rapid construction and selection of lambda 
phage clones containing properties from each of the two 
parent phage populations derived from lambda WT (CI857 

25 indl, Sam7) and lambda gtll (SamlOO) . 

Forty micrograms of a population of lambda phage 
derived from wild type lambda (WT) DNA (CI857 Sam7) 
(available from New England Biolabs) was partially 
digested with lambda Hindlll as determined by ethidium 
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derived from lambda gtll DNA (available from Stratagene 
Cloning Systems, San Diego, CA) was digested to completion 
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with a second enzyme BamHI in order to reduce the cloning 
efficiency of the left arm of the gtll phage (Maniatis et 
al. , supra ) . Both phage populations had been amplified 
lytically, which allowed for a relatively high degree of 
5 mutations in the resulting DNA. One microgram of the 
lambda WT DNA was ligated at the Hind III site to 1 to 4 
/ig of the lambda gtll DNA using T4 DNA ligase in a volume 
of less than 20/11, according to Maniatis, et al. , supra . 
The ligation mix was subsequently packaged in lambda phage 

10 packaging extract, Gigapack™ (Stratagene Cloning Systems, 
San Diego, CA) , as described by the manufacturer. 

The packaged phage library contained a mixture of 
many lambda phage constructions. In order to select for 
desired constructions, phenotypic selection was used to 

15 identify those members of the library displaying vigorous 
growth on supE bacterial hosts. As described by Maniatis 
et al . , supra, dilutions of the phage library was plated 
with E. coli C600 cells (Stratagene Cloning Systems, San 
Diego, CA) to generate a lawn of E. coli with isolated 

2 0 lambda plaques. These isolated plaques are result of 

clonal expansion from a single lambda phage clone. Since 
C600 cells are supE, the growth vigor of the individual 
lambda phage clones could be assessed by the size of the 
lambda phage plaque on the E . coli lawn. The parental WT 
25 phage do not form plaques on E . coli C600. At least three 
classes of phage were identified and subsequently categor- 
ized as small, medium, or large plaque size. The large 
plaque size was an indication of vigorous growth on the 
phage lawn, while small plaque size indicated poor growth. 

3 0 This demonstrates selection for the phenotype of the 

S gene based on plaque size. Other pehnotypes could be 
used for selection. 



■ uiJpL' tii_>L> Li<I"iAii j .'uL. i . ■ ^ » -l. nOS ..... I iU iCd t6u ^liciL 

at least one of the large plaque forming clones, L2 , did 
not contain an amber mutation as found in the lambda WT 
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(Sam7) or lambda gtll (C5100) parent phage- One of the 
small plaque phage, S2, contained the left arm of lambda 
WT gene and the right arm of lambda gtll containing the 
SamlOO gene. This SamlOO mutation is known to grow poorly 
5 on supE hosts and is optimal on a supF strain, with no 
growth on a supO host. The remaining library of clones 
displayed several different phenotypes, dictated by the 
diversity of the two starting populations of phage. Some 
clones also exhibited phenotypes that resulted from the 
10 random assortment of two mutant DNA fragments derived from 
just one of the parent DNA molecules. THis illustrates 
the concept that the two genes that give rise to the 
populations of interest need not be on separate DNA 
molecules at the start of the method. 
15 Due to the phenotypic selection applied following the 

ligation and packaging of the phage library, the large 
diversity of these two populations of phage was not com- 
pletely analyzed. However, the range of clones identified 
with alternate S gene phenotypes demonstrated some of this 
20 diversity. The diversity in these two populations of 
lambda phage is believed to be derived from the low level 
of spontaneous mutations which occur through repeated 
rounds of replication required in large scale preparations 
of lambda phage. However, the spontaneous mutations 
2 5 occurring within each of these individual phage popula- 
tions could not generate a collection of lambda phage 
containing characteristics of both parent populations of 
phage. This combinatorial approach, therefore, provides 
a mechanism in which novel constructions can be generated 
that express genes from both parent phage constructions. 



30 



several regions of conserved sequences that flank the V H 
35 domains. For instance, contain substantially conserved 
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to the same primer sequence. Therefore, polynucleotide 
synthesis (amplification) primers that hybridize to the 
conserved sequences and incorporate restriction sites into 
the DNA homolog produced that are suitable for operatively 
5 linking the synthesized DNA fragments to a vector were 
constructed. More specifically, the DNA homologs were 
inserted into lambda Zap II vector (Stratagene Cloning 
System, San Diego, CA) at the Xhol and EcoRI sites* For 
amplification of the V H domains, the 3 1 primer (primer 67 
10 in Table 7) , was designed to be complementary to the mRNA 
in the J H region. In all cases, the 5 f primers (primers 
56-65,, Table 7) were chosen to be complementary to the 
first strand cDNA in the conserved N-terminus region 
(antisense strand) . Initially amplification was performed 
15 with a mixture of 3 2 primers (primer 56, Table 7) that 
were degenerate at five positions. Hybridoma mRNA could 
be amplified with mixed primers, but initial attempts to 
amplify mRNA from spleen yielded variable results. 
Therefore, several alternatives to amplification using the 
20 mixed 5' primers were compared. 

The first alternative was to construct multiple 
unique primers, eight of which are shown in Table 7, 
corresponding to individual members of the mixed primer 
pool. The individual primers 52-64 of Table 7 were 
25 constructed by incorporating either of the two possible 
nucleotides at three of the five degenerate positions. 

The second alternative was to construct a primer 
containing inosine (primer 65, table 7) at four of the 
variable positions based on the published work of 
30 Takahashi, et al., Proc. Natl. Acad. Sci. (U.S.A.) , 
82:1931-1935, (1985) and Ohtsuka et al., J. Biol. Chem. , 

^^O-OrO^—^^^O ' "< O O CT n TiV, 4 — v~ : ~ >■ I- , - *- \- -, . - ~ - , --r r v +- K 

. ~ w - ... ...awwiit;. .,,..'j;ibt j : VBo • - - - ; - 

lions as discussed by Martin et al., Nu. Acids Res . , 
13:8927 (1985) . However, it was not known if the presence 

- ^ - no ^-; n o nuc! pot i icr- voul d result *r ^corporation ~ * 
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unwanted sequences in the cloned V H regions. Therefore, 
inosine was not included at the one position that remains 
in the amplified fragments after the cleavage of the 
restriction sites. As a result, inosine was not in the 

5 cloned insert. 

Additional, V H amplification primers including the 
unique 3 1 primer were designed to be complementary to a 
portion of the first constant region domain of the gamma 
l neavy cnain mRNA (primers 70 and 71, Table 7). These 
10 primers will produce DNA homologs containing polynucleo- 
tides coding for amino acids from the V H and the first 
constant region domains of the heavy chain. These DNA 
homologs can therefore be used to produce Fab fragments 

rather than an F v . 

15 as a control for amplification from spleen or 

hybridoma mRNA, a set of primers hybridizing to a highly 
conserved region within the constant region IgG, heavy 
chain gene were constructed. The 5 1 primer (primer 66, 
Table 7) is complementary to the cDNA in the C H 2 region 

20 whereas the 3' primer (primer 68, Table 7) is complement- 
ary to the mRNA in the C H 3 region. It is believed that no 
mismatches were present between these primers and their 
templates. 

The nucleotide sequences encoding the V L CDRs are 
25 highly variable. However, there are several regions of 
conserved sequences that flank the V L CDR domains including 
the J L , V L framework regions and V L leader/promoter. 
Therefore, amplification primers that hybridize to the 
conserved sequences and incorporate restriction sites that 
30 allowing cloning the amplified fragments into the 
pBluescript SK-vector cut with Nco I and Spel were con- 



35 (primer 70, Table 7) was chosen to be complementary to the 
first strand cDNA in the conserved N-terminus region 
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A second set of amplification primers for amplifica- 
tion of the V L CDR domains the 5 1 primers (primers 73-80 in 
Table 8) were designed to be complementary to the first 
strand cDNA in the conserved N-terminus region. These 
5 primers also introduced a Sac I restriction endonuclease 
site to allow the FLDNA homolog to be cloned into the V L II- 
expression vector. The 3 1 V L amplification primer (primer 
81 in Table 8) was designed to be complementary to the 
luRNA in the u L regions and to introduce the XbaX restzic— 
10 tion endonuclease site required to insert the V L DNA homolog 
into the V L II-expression vector (Figure 8) . 

Additional 3 1 V L amplification primers were designed 
to hybridize to the constant region of either kappa or 
lambda mRNA (primers 82 and 83 in Table 8) . These primers 
15 allow a DNA homolog to be produced containing polynucleo- 
tide sequences coding for constant region amino acids of 
either kappa or lambda chain. These primers make it 
possible to produce an Fab fragment rather than an F v . 

The primers used for amplification of kappa light 
20 chain sequences for construction of Fabs are shown at 
least in Table 8. Amplification with these primers was 
performed in 5 separate reactions, each containing one of 
the 5* primers (primers 75-78, and 84) and one of the 3 ! 
primer (primer 81) has been used to construct F v fragments. 
25 The 5 1 primers contain a Sac I restriction site and the 3' 
primers contain a Xbal restriction site. 

The primers used for amplification of heavy chain Fd 
fragments for construction of Fabs are shown at least in 
Table 7. Amplification was performed in eight separate 
3 0 reactions, each containing one of the 5 1 primers (primers 
57-64) and one of the 3 1 primers (primer 70). The remain- 



positions (primer 66, Table 7, and primers 89 and 90, 
Table 8). The remaining 3 1 primer (primer 86, Table 8) 

v mt h^en vised *rr construe* - F fraar.ontr*. Manv f th'"- r * 
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prixaers incorporate a Xho I site, and 3- primers include 

a Spel restriction site. 

V amplification primers designed to amplify human 
light chain variable regions of both the lambda and kappa 
5 isotypes are also shown in Table 8. 

All primers and synthetic polynucleotides used herein 
and shown on Tables 7-11 were either purchased from 
Research Genetics in Huntsville, Alabama or synthesized on 

; «n.4ai tria. using 

an Applied Biosystems una synui«^^ , 

10 the manufacturer's instructions. 
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3. Production Of A V„ Coding Repertoire Enrjcfred In 
FITC Bin fl* nc F Proteins 

Fluorescein isothiocyanate (FITC) was selected as a 
ligand for receptor binding. It was further decided to 
5 enrich by immunization the immunological gene repertoire, 
i.e., V H - and V L -coding gene repertoires, for genes coding 
for anti-FITC receptors. This was accomplished by linking 
FITC to keyhole limpet hemocyanin (KLH) using the tech- 
niques described in Antibodies A Laboratory Manual, Harlow 

10 and Lowe, eds., Cold Spring Harbor, New York, (1988). 
Briefly, 10.0 milligrams (mg) of keyhole limpet hemocyanin 
and 0.5 mg of FITC were added to 1 ml of buffer containing 
0.1 M sodium carbonate at pH 9.6 and stirred for 18 to 24 
hours at 4 degrees C (4C) . The unbound FITC was removed 

15 by gel filtration through Sephadex G-25. 

The KLH— FITC conjugate was prepared for injunction 
into mice by adding 100 yg of the conjugate to 250 /il of 
phosphate buffered saline (PBS). An equal volume of com- 
plete Freund's adjuvant was added and the entire solution 

20 was emulsified for 5 minutes. A 129 G IX+ mouse was 
injected with 3 00 jxl of the emulsion. Injections were 
given subcutaneous ly at several sites using a 21 gauge 
needle. A second immunization with KLH-FITC was given two 
week later. This injection was prepared as follows: 

2 5 fifty Mg of KLH-FITC were diluted in 2 50 fih of PBS and an 
equal volume of alum was admixed to the KLH-FITC solution. 
The mouse was injected intraperitoneal^ with 500 fil of 
the solution using a 23 gauge needle. One month later the 
mice were given a final injection of 50 _g of the KLH- 

30 FITC conjugate diluted to 200 _L in PBS. This injection 
was given intravenously in the lateral tail vein using a 



Hybridoma PCP 8D11 producing an antibody immuno- 
specific for phosphonate ester was cultured in DMEM media 
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percent fetal calf serum supplemented with penicillin and 
streptomycin. About 5 x 10 8 hybridoma cells were harvested 
and washed twice in phosphate buffered saline. Total 
cellular RNA was prepared from these isolated hybridoma 
5 cells. 

4. Preparation Of A V |r Codina Gene Repertoire 

Total cellular RNA was prepared from the spleen of a 
sincrle mouse immunized with KLH-FITC as described in 
Example 3 using the RNA preparation methods described by 
10 Chomczynski et al., Anal Biochem. , 162:156-159 (1987) 
using the manufacturer's instructions and the RNA isola- 
tion kit produced by stratagene Cloning Systems, La Jolla, 
CA. Briefly, immediately after removing the spleen from 
the immunized mouse, the tissue was homogenized in 10 ml 
15 of a denaturing solution containing 4.0 M guanine isothio- 
cyanate, 0.25 M sodium citrate at pH 7.0, and 0.1 M 2- 
mercaptoethanol using a glass homogenizer. One ml of 
sodium acetate at a concentration of 2 M at pH 4.0 was 
admixed with the homogenized spleen. One ml of phenol 
20 that had been previously saturated with H 2 0 was also 
admixed to the denaturing solution containing the homo- 
genized spleen. Two ml of a chloroform: isoamyl alcohol 
(24:1 v/v) mixture was added to this homogenate. The 
homogenate was mixed vigorously for ten seconds and 
25 maintained on ice for 15 minutes. The homogenate was then 
transferred to a thick-walled 50 ml polypropylene centri- 
fuge tube (Fisher Scientific Company, Pittsburgh, PA) . 
The solution was centrifuged at 10,000 x g for 20 minutes 
at 4 W C. The upper RNA-containing aqueous layer was 
3 0 transferred to a fresh 50 ml polypropylene centrifuge tube 
and mixed with an eoual volume nf i^oDroDvl alcohol Thi^ 

tateu RNA was centriruqea di iu,UUu x g lor twenty minutes 
3 5 at 4°C. The pelleted total cellular RNA was collected and 
dissolved in 3 ml of the denaturina solution described 
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above. Three ml of isopropyl alcohol was added to the 
resuspended total cellular RNA and vigorously mixed. This 
solution was maintained at -20 °C for at least 1 hour to 
precipitate the RNA. The solution containing the precipi- 
5 tated RNA was centrifuged at 10 , 000 x g for ten minutes at 
4°C. The pelleted RNA was washed once with a solution 
containing 75% ethanol. The pelleted RNA was dried under 
vacuum for 15 minutes and then resuspended to dimethyl 

nvmrarbnnafce fDEPC} treated H 2 0 (DEPC-H 2 0) . 

IT J , , 4. 

10 Messenger RNA (mRNA) enriched for sequences contain- 

ing long poly A tracts was prepared from the total 
cellular RNA using methods described in Molecular Cloning 
A Laboratory Manual , Maniatias et al., eds. Cold Spring 
Harbor Laboratory, New York, (1982). Briefly, one half of 
15 the total RNA isolated from a single immunized mouse 
spleen prepared as described above was resuspended in one 
ml of DEPC-H 2 0 and maintained at 65 C C for five minutes. 
One ml of 2x high salt loading buffer consisting of 100 mM 
Tris-HCl, 1M sodium chloride, 2.0 mM disodium ethylene 
20 diamine tetraacetic acid (EDTA) at pH 7.5, and 0.2% sodium 
dodecyl sulfate (SDS) was added to the resuspended RNA and 
the mixture allowed to cool to room temperature. The 
mixture was then applied to an oligo-dT (Collaborative 
Research Type 2 or Type 3) column that was previously 
2 5 prepared by washing the oligo-dT with a solution contain- 
ing 0.1 M sodium hydroxide and 5 mM EDTA and then 
equilibrating the column with DEPC-H 2 0. The eluate was 
collected in a sterile polypropylene tube and reapplied to 
the same column after heating the eluate for 5 minutes at 
65 °C. The oligo dT column was then washed with 2 ml of 
high salt loading buffer consisting of 50 mM Tris-HCl at 
r-^u * c;nn sodium rhlor-idp i ^m FDTA at r>H 7 . *5 and 



30 



ph , . , .Uu mM soaium unionae - mK EDTA ana u.it ^Di>. 
The messenger RNA was eluted from the oligo dT column with 
i^i of buffer consist ina of 10 mM Tris-HCL at dH 7.5, 1 mM 
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EDTA at pH 7.5 and 0.05% SDS. The messenger RNA was puri- 
fied by extracting this solution with phenol/chloroform 
followed by a single extraction with 100% chloroform. The 
messenger RNA was concentrated by ethanol precipitation 
5 and resuspended in DEPC- 1^0. 

The messenger RNA isolated by the above process 
contains a plurality of different V H coding polynucleo- 
tides, i.e., greater than about 10 4 different v H -coding 

genes . 

10 5. Preparation Of A Single V n Coding Polynucleotide 

Polynucleotides coding for a single V H were isolated 
according to Example 4 except total cellular RNA was 
extracted from monoclonal hybridoma cells prepared in 
Example 3. The polynucleotides isolated in this manner 

15 code for a single V H . 

6. DNA Homolog Preparation 

In preparation for PCR amplification, mRNA prepared 
according to the above examples was used as a template for 
cDNA synthesis by a primer extension reaction. In a 

20 typical 50 ul transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first hybridized (annealed) 
with 500 ng (50.0 pmol) of the 3» V H primer (primer 67, 
Table 7), at 65 °C for five minutes. Subsequently, the 
mixture was adjusted to 1.5 mM dATP, dCTP, dGTP and dTTP, 

25 40 mM Tris-HCl at pH 8.0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM 
spermidine. Moloney-Murine Leukemia virus Reverse 
transcriptase (Stratagene Cloning Systems), 26 units, was 
added and the solution was maintained for 1 hours at 37 °C. 
PCR amplification was performed in a 100 ul reaction 



the b' V primers (primer 57-65 of Table 7; zoo mM or a 
mixture of dNTP's, 50 mM KCl , 10 mM Tris-HCl pH 8.3, 15 mM 
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10 



reaction mixture was overlaid with mineral oil and sub- 
jected to 40 cycles of amplification. Each amplification 
cycle involved denaturation at 92 °C for 1 minute, anneal- 
ing at 52 °C for 2 minutes and polynucleotide synthesis by 
Primer extension (elongation) at 72 *C for 1.5 minutes. 
The amplified V H -coding DNA homolog containing samples were 
extracted twice with phenol/chloroform, once with chloro- 
form, ethanol precipitated and were stored at -70 'C in 10 
mM Tris-HCl, (pH, 7.5) and 1 mM EDTA. 

Using unique 5 1 primers (57-64, Table 7), efficient 
V H -coding DNA homolog synthesis and amplification from the 
spleen mRNA was achieved as shown in Figure 3, lanes R17- 
R24. The amplified cDNA (V„-coding DNA homolog) is seen as 
a major band of the expected size (360 bp). The intensi- 
15 ties of the amplified V H -coding polynucleotide fragment in 
each reaction appear to be similar, indicating that all of 
these primers are about equally efficient in initiating 
amplification. The yield and quality of the amplification 
with these primers was reproducible. 
20 The primer containing inosine also synthesized ampli- 

fied V H -coding DNA homologs from spleen mRNA reproducibly , 
leading to the production of the expected sized fragment, 
of an intensity similar to that of the other amplified 
cDNAs (Figure 4, lane R16) . This result indicated that 
25 the presence of inosine also permits efficient DNA homolog 
synthesis and amplification. Clearly indicating how 
useful such primers are in generating a plurality of V H - 
coding DNA homologs. Amplification products obtained from 
the constant region primers (primers 66 and 68, Table 7) 
30 were more intense indicating that amplification was more 
efficient, possibly because of a higher degree of homology 



each performed with a different 5' primer. Equal portions 
of the products from each primer extension reaction were 
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mixed and the mixed product was then used to generate a 
library of V H -coding DNA homolog-containing vectors. 

DNA homo logs of the V L were prepared from the purified 
mRNA prepared as described above. In preparation for PCR 
5 amplification, mRNA prepared according to the above 
examples was used as a template for cDNA synthesis. In a 
typical 50 ul transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first annealed with 300 ng 



adjusted to 1.5 mM dATP, dCTP, dGTP, and dTTP, 40 mM Tris- 
HCL at pH 8.0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM spermidine. 
Moloney-Murine Leukemia virus reverse transcriptase 
(Stratagene Cloning Systems) , 2 6 units, was added and the 

15 solution was maintained for 1 hour at 37 Q C The PCR 
amplification was performed in a 100 ul reaction contain- 
ing approximately 5 ug of the cDNA/RNA hybrid produced as 
described above, 300 ng of the 3' V L primer (primer 69 of 
Table 7) r 300 ng of the 5 1 V L primer (primer 70 of Table 

20 7) , 200 mM of a mixture of dNTP's, 50 mM KC1, 10 mM Tris- 
HC1 pH 8.3, 15 mM MgCl 2 , 0.1% gelatin and 2 units of Taq 
DNA polymerase. The reaction mixture was overlaid with 
mineral oil and subjected to 40 cycles of amplification. 
Each amplification cycle involved denaturation at 92 *C for 

25 1 minute, annealing at 52 °C for 2 minutes and elongation 
at 72 °C for 1.5 minutes. The amplified samples were 
extracted twice with phenol/chloroform, once with chloro- 
form, ethanol precipitated and were stored at 70 'C in 10 
mM Tris-HCl at pH 7.5 and 1 mM EDTA. 

3 0 7 . Inserting DNA Homologs Into Vectors 



mg/m± Dovine serum albumin (BSA) at 3 7 C were digested 
3 5 with restriction enzymes Xho I (125 units) and EcoR I (10 



10 65 *C for five minutes. 






tin -i r> h p u fi f i e d r 1 ^; cidci rose o e 1 
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ments which required a mixture of the products of the 
amplification reactions, equal volumes (50 ul, 1-10 ug 
concentration) of each reaction mixture were combined 
after amplification but before restriction digestion. 
After gel electrophoresis of the digested PCR amplified 
spleen mRNA, the region of the gel containing DNA frag- 
ments of approximately 350 bps was excised, electroeluted 
into a dialysis membrane, ethanol precipitated and resus- 
pended in 10 mM Tris-HCl pH 7.5 and 1 mM EDTA to a final 
concentration of 10 ng/ul. Equimolar amounts of the 
insert were then ligated overnight at 5 Q C to 1 ug of 
Lambda Zap™ II vector (Stratagene Cloning Systems, La 
Jolla, CA) previously cut by EcoR I and Xho I. A portion 
of the ligation mixture (1 ul) was packaged for 2 hours at 
15 room temperature using Gigapack Gold packaging extract 
(Stratagene Cloning Systems, La Jolla, CA) , and the pack- 
aged material was plated on ILl-blue host cells. The 
Library was determined to consist of 2 x 10 7 V H homologs 
with less than 30% non-recombinant background. 
20 The vector used above, Lambda Zap II is a derivative 

of the original Lambda Zap (ATCC # 40,298) that maintains 
all of the characteristics of the original Lambda Zap 
including 6 unique cloning sites, fusion protein expres- 
sion, and the ability to rapidly excise the insert in the 
25 form of a phagemid (Bluescript SK-) , but lacks the SAM 100 
mutation, allowing growth on many Non-Sup F strains, 
including XLl-Blue. The Lambda Zap II was constructed as 
described in Short et al., Nucleic Acids Res., 16:7583- 
7600, (1988), by replacing the Lambda S gene contained in 
30 a 4254 base pair (bp) DNA fragment produced by digesting 
Lambda Zap with the restriction enzyme Ncol. This 4254 bp 



3 5 restriction enzyme Ncol. The 4254 bp DNA fragment 
isolated from lambda gtlO was ligated into the original 
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cols for such procedures described in Current Protocols in 
Molecular Biology , Ausubel et al., eds., John Wiley and 

Sons, New York, (1987). 

In preparation of cloning a library enriched in V L 
5 sequences, 2 ug of PCR amplified products (2.5 mg/30 ul of 
150 mM NaCl, 8 mM Tris-HCl (pH 7.5), 6 mM MgSo 4 , 1 mM DTT, 
200 mg/ml BSA) were digested with restriction enzymes Nco 
I (30 unites) and Spe I (45 units) at 37 *C for 2 hours. 
The digested PCR amplified products were purified on 1% 
10 agarose gel using standard electroelution technique des- 
cribed in Molecular Cloning A Laboratory Manual, Maniatis 
et al., eds., Cold Spring Harbor, New York, (1982). 
Briefly, after gel electroelution of the digested PCR 
amplified product the region of the gel containing the V L - 
15 coding DNA fragment of the appropriate size was excised, 
electroelution into a dialysis membrane, ethanol precipi- 
tated and resuspended at a final concentration of 10 ng 
per ml in a solution containing 10 mM Tris-HCL at pH 7.5 
and 1 mM EDTA. 

20 An equal molar amount of DNA representing a plurality 

of different V L -coding DNA homologs was ligated to a 
pBluescript SK- phagemid vector that had been previously 
cut with Nco I and Spe I. A portion of the ligation mix- 
ture was transformed using the manufacturer's instructions 

25 into Epicuian Coli XLl-Blue competent cells (Stratagene 
Cloning Systems, La Jolla, CA) . The transformant library 
was determined to consist of 1.2 x 10 3 colony forming 
units/ug of V L homologs with less than 3% non-recombinant 
background . 

30 8 . Sequencing of Plasmids From the V ir Coding cDNA Library 



•• : e i ii d n u I a c t. u r > * ■ . : j s tru^i i r j i s : i d \ agent:- , : * ..... i i m v > l e;.. . 

La Jolla, CA) . Briefly, phage plaques were cored from the 
3 5 agar plates and transferred to sterile microfuge tubes 

■■■ — y: " -r . ruffer -\ : r. i na 5^ r.M Tris-HCT. pH 
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7.5, 100 mM NaCl, 10 mM MgS0 4 , and 0.01% gelatin and 20 uL 

of Chloroform. 

For excisions, 200 ul of the phage stock, 200 ul of 
XLl-Blue cells (A^ = 1.00) and 1 ul of R408 helper phage 
5 (1 x 10 11 pfu/ml) were incubated at 37 'C for 15 minutes. 
The excised plasmids were infected into XLl-Blue cells and 
plated onto LB plates containing ampicillin. Double 
stranded DNA was prepared from the phagemid containing 
cells according to the methods described by Holmes et ai., 
10 Anal. Biochem. , 114:193, (1981). Clones were first 
screened for DNA inserts by restriction digests with 
either Pvu II or Bgl I and clones containing the putative 
V H insert were sequenced using reverse transcriptase 
according to the general method described by Sanger et 

15 al., Proc. Natl. At^d. Sci. ■ USA , 74:5463-5467, (1977) and 

the specific modifications of this method provided in the 
manufacturer's instruction in the AMV reverse transcript- 
ase 35 S-dATP sequencing kit from Stratagene Cloning 
Systems, La Jolla, CA. 

2 0 9. Characterization Of The C loned V„ Repertoire 

The amplified products which had been digested with 
Xho I and EcoR I and cloned into Lambda Zap, resulted in 
a cDNA library with 9.0 x 10 s pfu's. In order to confirm 
that the library consisted of a diverse population of V H - 

25 coding DNA homologs, the N-terminal 120 bases of 18 
clones, selected at random from the library, were excised 
and sequenced (Figure 5) . To determine if the clones were 
of V H gene origin, the cloned sequences were compared with 
known V H sequences and V L sequences. The clones exhibited 

30 from 80 to 90% homology with sequences of known heavy 



r-. . '"Bpnnmaicai _nteres i -..lou 
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et al., 4th ed., U.S. Dept. of Health and Human Sciences, 
35 (1987). This demonstrated that the library was enriched 
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for the desired V H sequence in preference to other 
sequences, such as light chain sequences. 

The diversity of the population was assessed by 
classifying the sequenced clones into predefined subgroups 
5 (Figure 5) . Mouse V H sequences are classified into eleven 
subgroups (Figure 5) . Mouse V H sequences are classified 
into eleven subgroups [I (A,B,) , II (A,B,C), III (A r B,C,D( 
V (A,B)] based on framework amino acid sequences described 
in Sequences of proteins of Immunological interest by 
10 Kabot et al., 4th ed., U.S. Dept. of Health and Human 
Sciences, (1987); Dildrop, Immunology Today . 5:84 f (1984); 
and Brodeur et al., Eur. J. Immunol. . 14:922, (1984). 
Classification of the sequenced clones demonstrated that 
the cDNA library contained V H sequences of at least 7 
15 different subgroups. Further, a pairwise comparison of 
the homology between the sequenced clones showed that no 
two sequences were identical at all positions, suggesting 
that the population is diverse to the extent that it is 
possible to characterize by sequence analysis. 
20 Six of the clones (L 36-50, Figure 5) belong to the 

subclass III B and had very similar nucleotide sequences. 
This may reflect a preponderance of mRNA derived from one 
or several related variable genes in stimulated spleen, 
but the data does not permit ruling out the possibility of 
25 a bias in the amplification process. 

10 . V n-Expression Vector Construction 

The main criterion used in choosing a vector system 
was the necessity of generating the largest number of Fab 
fragments which could be screened directly. Bacteriophage 

3 0 lambda was selected as the expression vector for three 

" ■ ' ■ ■•. . * . k - ■ 

t . -:^uiiu ,.jsb>;. : ■ lt-iLfcu;. . r-n.ej.. - a d Ten- 

sion at the level of single phage plaques. Finally, the 
35 screening of phage libraries typically involve less 

"' : r r ■ ■. i f- i i ; i:onf'>t)or i *~ i \* i nd inci , A r ■? ] torn r*. *" ' v r c : 
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plasmid cloning vectors, are only advantageous in the 
analysis of clones after they have been identified. This 
advantage is not lost in the present system because of the 
use of lambda Zap, thereby permitting a plasmid containing 
5 the heavy chain, light chain, or Fab expressing inserts 

to be excised. 

To express the plurality of V H -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the V H -coding DNA homologs in the proper reading frame, 

10 provided a ribosome binding site as described by Shine et 
al., Nature, 254:34, 1975, provided a leader sequence 
directing the expressed protein to the periplasmic space, 
provided a polynucleotide sequence that coded for a known 
epitope (epitope tag) and also provided a polynucleotide 

15 that coded for a spacer protein between the V H -coding DNA 
homolog and the polynucleotide coding for the epitope tag. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 
single stranded polynucleotide segments of 20-40 bases 

20 that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 6. The 
individual single-stranded polynucleotides (N1-N12) are 

shown in Table 9. 

Polynucleotides N2, N3, N9-4 ' , Nil, N10-5', N6 , N7 

25 and N8 were kinased by adding 1 ul of each polynucleotide 
(0.1 ug/ul) and 20 units of T4 polynucleotide kinase to a 
solution containing 70 raM Tris-HCl at pH 7.6, 10 mM MgCl 2 
5 mM DTT, 10 mM 2-mercaptoethanol (2ME) , 500 micrograms 
per ml of BSA. The solution was maintained at 37 "C for 30 

30 minutes and the reaction stopped by maintaining the solu- 
tion at 65 'C for 10 minutes. The two end polynucleotides 



3 5 HC1 at pH 7.4, 2.0 mM MgCl 2 
tion was heated to 70 "C for 

.-• * :-v:,v<": • it : ■ ■ r • " 



and 50.0 mM NaCl . This solu- 
5 minutes and allowed to cool 
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a 500 ml beaker of water. During this time period all 10 
polynucleotides annealed to form the double stranded 
synthetic DNA insert shown in Figure 6A. The individual 
polynucleotides were covalently linked to each other to 
5 stabilize the synthetic DNA insert by adding 40 ul of the 
above reaction to a solution containing 50 mM Tris-HCl at 
pH 7.5, 7 mM MgCl 2 , 1 mM DTT, 1 mM adenosine triphosphate 
(ATP) and 10 units of T4 DNA ligase. This solution was 
maintained at 37 *C for 3 0 minutes and then the T4 DNA 
10 ligase was inactivated by maintaining the solution at 65 *C 
for 10 minutes. The end polynucleotides were kinased by 
mixing 52 ul of the above reaction, 4 ul of a solution 
containing 10 mM ATP and 5 units of T4 polynucleotide 
kinase. This solution was maintained at 37 # C for 3 0 
15 minutes. The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been previ- 
ously digested with the restriction enzymes Not I and Xho 
I. The ligation mixture was packaged according to the 
manufacture's instructions using Gigapack II Gold packing 
20 extract available from Stratagene Cloning Systems, La 
Jolla, CA. The packaged ligation mixture was plated on 
XL1 blue cells (Stratagene Cloning Systems, San Diego, 
CA) . Individual lambda Zap II plaques were cored and the 
inserts excised according to the in vivo excision protocol 
25 provided by the manufacturer, Stratagene Cloning Systems, 
La Jolla, CA. This in vivo excision protocol moves the 
cloned insert from the lambda Zap II vector into a plasmid 
vector to allow easy manipulation and sequencing. The 
accuracy of the above cloning steps was confirmed by 
30 sequencing the insert using the Sanger dideoxide method 
described in by Sanger et al., Proc. Natl. Acad. Sci USA , 



J .J 



sequence 01 the resulting V H expression vector is snown in 
Figure 6A and Figure 7 . 
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Table 9 

(91) Nl) 

(92) N2) 

(93) N3) 

(94) N4) 

(95) N5) 

(96) N6) 

(97) N7) 

(98) N8) 

(99) N9-4) 

(100) Nil) 

(101) N12) 

(102) N10- 



5 • GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 ■ 
5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3» 
5« GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3' 
5' AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 3' 
5» TCGACTATTAACTAGTCTAGAATTCTCGAG 3' 
5 1 CAGTTTCACCTGGGCCATGGCTGGTTGGG 3" 
5» CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3' 
b • GTAT M T"TCAT"TATGACTGTCTCCTTGAAATAG 3 - 

5 ' AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 1 
5' GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3» 
5 1 TCGACGAATTCTATTAAGAACCGTAGTC 3 1 
5) 5 CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3» 



30 



11. V L Expression Vector Co nstruction 

To express the plurality of V L coding polynucleotides 
in an E . coli host cell, a vector was constructed that 
placed the V L coding polynucleotide in the proper reading 
frame, provided a ribosome binding site as described by 
Shine et al., Nature , 254:34, (1975), provided a leader 
sequence directing the expressed protein to the piro- 
plasmic space and also provided a polynucleotide that 
coded for a spacer protein between the V L polynucleotide 
and the polynucleotide coding for the epitope tag. A 
synthetic DNA sequence containing all of the above poly- 
nucleotides and features was constructed by designing 
single stranded polynucleotide segments of 20-40 bases 
that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 6B. The 
individual single-stranded polynucleotides (N1-N8) are 
shown in Table 9, 



35 



': i u iv nucleoli 



p <~~> 



i ..' 4 I L. _i -i. i I ill" 



Tris-HCL at pH 7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM 2 ME , 500 
micrograms per ml of BSA. The solution was maintained at 
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ing the solution at 65 *C for 10 minutes. The two end 
polynucleotides 20 ng of polynucleotides Nl and poly- 
nucleotides N5 were added to the above kinasing reaction 
solution together with 1/10 volume of a solution contain- 
5 ing 20.0 mM Tris-HCL at pH 7.4, 2.0 mM MgCl 2 and 50.0 raM 
NaCl. This solution was heated to 70 # C for 5 minutes and 
allowed to cool to room temperature, approximately 25* C, 
over 1.5 hours in a 500 ml beaker of water. During this 
time period all the polynucleotides annealed to form the 
10 double stranded synthetic DNA insert. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert with adding 40 ul of 
the above reaction to a solution containing 50 ul Tris- 
HCL at pH 7.5, 7 mM MgCl 2 , 1 mM DTT, 1 mM ATP and 10 units 
15 to T4 DNA ligase. This solution was maintained at 37 °C 
for 30 minutes and then the T4 DNA ligase was inactivated 
by maintaining the solution at 65 'C for 10 minutes. The 
end polynucleotides were kinased by mixing 52 ul of the 
above reaction, 4 ul of a solution recontaining 10 mM ATP 
20 and 5 units of T4 polynucleotide kinase. This solution 
was maintained at 37 *C for 30 minutes and then the T4 
polynucleotide kinase was inactivated by maintaining the 
solution at 65 Q C for 10 minutes. The completed synthetic 
DNA insert was ligated directly into a lambda Zap II 
2 5 vector that had been previously digested with the restric- 
tion enzymes Not I and Xho I. The ligation mixture was 
packaged according to the manufacture's instructions using 
Gigapack II Gold packing extract available from stratagene 
Cloning Systems, La Jolla, CA. The packaged ligation 
30 mixture was plated on XLl-Blue cells (Stratagene Cloning 
Systems, La Jolla, CA) . Individual lambda Zap II plagues 
were cored and the inserts pyri se^ armrdina to thp 1 n 

jnoiL el di., Nucleic Acias Res. , _ls : / d8 3 - / 600 ^1988). 
This in vivo excision protocol moves the cloned insert 
from the lambda Zap IT vector into a nhaaemid vector to 
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allow easy manipulation and sequencing and also produces 
the phagemid version of the V L expression vectors. The 
accuracy of the above cloning steps was confirmed by 
sequencing the insert using the Sanger dideoxide method 
5 described by Sanger et al., Proc. Na tl. Acad. Aci. USA, 
74:5463-5467, (1977) and using the manufacturer's instruc- 
tions in the AMV reverse transcriptase 35 S-dATP sequencing 
kit from Stratagene Cloning Systems, La Jolla, CA. The 
sequence of the resulting V L expression vector is shown in 

10 Figure 6 and Figure 8. 

The V L expression vector used to construct the V L 
library was the phagemid produced to allow the DNA of the 
V L expression vector to be determined. The phagemid was 
produced, as detailed above, by the in vivo excision 

15 process from the Lambda Zap V L expression vector (Figure 
8) . The phagemid version of this vector was used because 
the Nco I restriction enzyme site is unique in this 
version and thus could be used to operatively linked the 
V L DNA homologs into the expression vector. 

20 12. V L II-Expression Vector Construction 

To express the plurality of V L -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the V L -coding DNA homologs in the proper reading frame, 
provided a ribosome binding site as described by Shine et 

25 al., Nature , 254:34, 1975, provided the Pel B gene leader 
sequence that has been previously used to successfully 
secrete Fab fragments in E. coli by Lei et al., J_. — Bac. , 
169:4379 (1987) and Better et al., Science , 240:1041 
(1988) , and also provided a polynucleotide containing a 

3 0 restriction endonuclease site for cloning. A synthetic 

m>t> _ n ^ 10T . _ ,„4~ : - ,* - r U ,v - -. *■ 

... ^ > nuoi u o l .... u k r, eqmem,. - • * » :msb:. i i a o u jl . ■ v u i - '■■ 1 

lze to each other and form the double stranded synthetic 
3 5 DNA sequence shown in Figure 10. The sequence of each 

: ri?v ; iua1 -indie-stranded polynucl o~t i der, '0V09 ,! ^ithir. 
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the double stranded synthetic DNA sequence is shown in 
Table 10. 

Polynucleotides 02, 03, 04, 05, 06 and 07 were 
kinased by adding 1 ul (0-1 ug/ul) of each polynucleotide 
5 and 20 units of T4 polynucleotide kinase to a solution 
containing 70 mM Tris-HCL at pH 7.6, 10 mM magnesium 
chloride (MgCl) , 5 mM dithiothreitol (DTT) , 10 mM 2- 
mercaptoethanol (2ME) , 500 micrograms per ml of bovine 
serum albumin. The solution was maintained at 37 *c for 30 
10 minutes and the reaction stopped by maintaining the 
solution at 65 *C for 10 minutes. The 20 ng each of the 
two end polynucleotides, 01 and 08, were added to the 
above kinasing reaction solution together with 1/10 volume 
of a solution containing 20.0 mM Tris-HCl at pH 7.4, 2.0 
15 mM MgCl and 15.0 mM sodium chloride (NaCl) . This solution 
was heated to 70 *C for 5 minutes and allowed to cool to 
room temperature, approximately 25° C, over 1.5 hours in a 
500 ml beaker of water. During this time period all 8 
polynucleotides annealed to form the double stranded 
20 synthetic DNA insert shown in Figure 9. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert by adding 40 ul of the 
above reaction to a solution containing 50 ml Tris-HCl at 
pH 7.5, 7 ml MgCl, 1 mm DTT, 1 mm ATP and 10 units of T4 
25 DNA ligase. This solution was maintained at 37 °C for 3 0 
minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65 # C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 ul of the above 
reaction, 4 ul of a solution containing 10 mM ATP and 5 
3 0 units of T4 polynucleotide kinase. This solution was 
maintained at 37 W C for 3 0 minutes and then the T4 poly- 
nucleotide kinase was inactivated bv maintain! na thp 

... s,.- 

cic; unau naa oeen previously digested witn tne restric- 
tion enzymes Not I and Xho I. The ligation mixture was 
packaged according to the manufacturer's instructions 
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using Gigapack II Gold packing extract available from 
Stratagene Cloning Systems, La Jolla, CA. The packaged 
ligation mixture was plated on XL1 blue cells (Stratagene 
Cloning Systems, San Diego, CA) . Individual lambda Zap II 
5 plaques were cored and the inserts excised according to 
the in vivo excision protocol provided by the manufac- 
turer, Stratagene Cloning Systems, La Jolla, CA. This xn 
vivo excision protocol moves the cloned insert from the 
lambda Zap II vector into a plasmid vector to allow easy 

10 manipulation and sequencing. The accuracy of the above 
cloning steps was confirmed by sequencing the insert using 
the manufacturer's instructions in the AMV Reverse 
Transcriptase 35 S-dATP sequencing kit from Stratagene 
Cloning Systems, La Jolla, CA. The sequence of the 

15 resulting V L II-expression vector is shown in Figure 9 and 
Figure 11. 



Table 10 

(102) 01) 5' TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAT 3' 

(103) 02) 5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3' 
20 (104) 03) 5* GTTATTACTCGCTGCCCAACCAGCCATGGCC 3' 

(105) 04) 5' GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3' 

(106) 05) 5' 

GTATTTCATTATGACTGTCTCCTTGGCGACTAGTTTAGAATTCAAGCT 

3 ' 

25 (107) 06) 5' CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3' 

(108) 07) 5' TGACGAGCTCGGCCATGGCTGGTTGGG 3' 

(109) 08) 5' TCGACGGCCGCTTAACTCTAGAAC 3' 

13 - v^- v L Library C onstruction 

To prepare an expression library enriched in V H 



These nomoiogs were then digested with the restriction 
enzymes Xho I and Spe I and purified on a 1% agarose gel 

^ur, -.HiH oTnctroelution f *»chni an«=> described ir 
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Molecular Cloning A Laboratory Manual . Maniatis et al., 
eds., Cold Spring Harbor, New York, (1982) . These 
prepared V H DNA homologs were then directly inserted into 
the V H expression vector that had been previously digested 
5 with Xho I and Spe I. 

The ligation mixture containing the V H DNA homologs 
were packaged according to the manufacturers specifica- 
tions using Gigapack Gold II Packing Extract (Stratagene 
Cloning Systems, La Jolla, CA) . The expression libraries 

10 were then ready to be plated on XL-1 Blue cells. 

To prepare a library enriched in V L sequences, PCR 
amplified products enriched in V L sequences were prepared 
according to Example 7. The V L DNA homologs were digested 
with restriction enzymes Nco I and Spe I. The digested V L 

15 DNA homologs were purified on a 1% agarose gel using 
standard electrelusion techniques described in Molecular 
Cloning A Laboratory Manual , Maniatis et al., eds., Cold 
Spring Harbor, NY (1982). The prepared V L DNA homologs 
were directly inserted into the V L expression vector that 

20 had been previously digested with the restriction enzymes 
Nco I and Spe I. The ligation mixture containing the V L 
DNA homologs were transformed into XL-1 blue competent 
cells using the manufacturer's instructions (Stratagene 
Cloning Systems, La Jolla, CA) . 

25 14. Inserting V L Coding DNA Homologs Into V L Expression 
Vector 

In preparation for cloning a library enriched in V L 
sequences, PCR amplified products (2.5 ug/30 ul of 150 mM 
NaCl, 8 mM Tris-HCl (pH 7.5), 6 mM MgCl 4 , 1 mM DTT, 200 
30 ug/ml BSA at 37 °C were digested with restriction enzymes 
Sac I (125 units) and Xba I (125 units) and purified on a 



35 reaction mixture were combined after amplification but 
before restriction digestion. After gel electrophoresis 
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10 



of the digested PCR amplified spleen mRNA, the region of 
the gel containing DNA fragments of approximately 350 bps 
was excised, electroeluted into a dialysis membrane, 
ethanol precipitated and resuspended in a TE solution 
containing 10 mM Tris-HCl pH 7.5 and 1 iM EDTA to a final 
concentration of 50 ng/ul. 

The V L II-expression DNA vector was prepared for 
cloning by admixing 100 ug of this DNA to a solution 
containing 250 units each of the restriction endonucleases 
Sac 1 and Xba 1 (both from Boehringer Mannheim, 
Indianapolis, IN) and a buffer recommended by the manu- 
facturer. This solution was maintained at 37 *C for 1.5 
hours. The solution was heated at 65 'C for 15 minutes to 
inactivate the restriction endonucleases. The solution 
15 was chilled to 30 'C and 25 units of heat-killable (HK) 
phosphatase (Epicenter, Madison, WI) and CaCl 2 were admixed 
to it according to the manufacturer's specifications. 
This solution was maintained at 30' C for 1 hour. The DNA 
was purified by extracting the solution with a mixture of 
20 phenol and chloroform followed by ethanol precipitation. 
The V L II expression vector was now ready for ligation to 
the V, DNA homologs prepared in the above examples. 

DNA homolog enriched in V L sequences were prepared 
according to Example 6 but using a 5* light chain primer 
25 and 3- light chain primer shown in Table 9. Individual 
amplification reactions were carried out using each 5' 
light chain primer in combination with the 3- light chain 
primer. These separate V L homolog-containing reaction 
mixtures were mixed and digested with the restriction 
endonucleases Sac 1 and Xba 1 according to Example 7. The 
V L homologs were purified on a 1% agarose gel using the 



30 



rnrv: M.irnn- 'Ipw -nr. H / r»ese : reparei: 'NA 

3 5 homologs were then directly inserted into the Sac 1 - Xba 
cleaved V L II-expression vector that was prepared above by 
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of the V L II-expression vector overnight at 5 # C. 3.0 x 10 5 
plaque forming units were obtained after packaging the DNA 
with Gigapack II Bold (Stratagene Cloning Systems, La 
Jolla, CA) and 50% were recombinants. 

5 15 . Randomly Combining V„ and V L DNA Homoloas on the Same 
Expression Vector 

The V L II-expression library prepared in Example 13 was 
amplified and 500 ug of V L II-sxpression library phage DNA 
prepared from the amplified phage stock using the proce- 

10 dures described in Molecular Cloning: A Laboratory 
Manual , Maniatis et al., eds., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1982) , 50 ug of this 
V L II-expression library phage DNA was maintained in a 
solution containing 100 units of Lul restriction endo- 

15 nuclease (Boehringer Mannheim, Indianapolis, IN) in 200 ul 
of a buffer supplied by the endonuclease manufacturer for 
1.5 hours at 37 *C„ The solution was then extracted with 
a mixture of phenol and chloroform. The DNA was then 
ethanol precipitated and resuspended in 100 ul of water. 

20 This solution was admixed with 100 units of the restric- 
tion endonuclease EcoR I (Boehringer Mannheim, 
Indianapolis, IN) in a final volume of 200 ul of buffer 
containing the components specified by the manufacturer. 
This solution was maintained at 37 # C for 1.5 hours and the 

25 solution was then extracted with a mixture of phenol and 
chloroform. The DNA was ethanol precipitated and the DNA 
resuspended in TE. 

The V H expression library prepared in Example 13 was 
amplified and 500 ug of V H expression library phage DNA 

3 0 prepared using the methods detailed above. 50 ug of the 
V exDress i on 1 ibr^TV nb^a^ nxj* v**- "n^^->^n^ : - "» ■ ■ 

ui a cutter supplied by the endonuclease manufacturer for 
3 5 1.5 hours at 37°C. The solution was then extracted with 

a mixture of phenol and chloroform saturated with n - 1 
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Tris-HCL at pH 7.5. The DNA was then ethanol precipitated 
and resuspended in 100 ul of water. This solution was 
admixed with 100 units of the restriction endonuclease 
EcoR I (Boehringer Mannheim, Indianapolis, IN) in a final 
5 volume of 200 ul of buffer containing the components 
specified by the manufacturer. This solution was main- 
tained at 37 # C for 1.5 hours and the solution was then 
extracted with a mixture of phenol and chloroform. The 
DNA was ethanol precipitated and the DNA resuspended in 
10 TE. 

The restriction digested V H and V L II-expression 
Libraries were ligated together. The ligation reaction 
consisted of 1 ug of V H and 1 ug of V L II phage library DNA 
in a 10 ul reaction using the reagents supplied in a liga- 

15 tion kit purchased from Stratagene Cloning Systems (La 
Jolla, CA). After ligation for 16 hr at 4*C, l ul of the 
ligated the phage DNA was packaged with Gigapack Gold II 
packaging extract and plated on XL 1-blue cells prepared 
according the manufacturer's instructions. A portion of 

20 the 3X10 6 clones obtained were used to determine the 
effectiveness of the combination. The resulting V H and V L 
expression vector is shown in Figure 11. 

Clones containing both V H and V L were excised from the 
phage to pBluescript using the in vitro excision protocol 

25 described by Short et al., Nucleic A cid Research, 16L7583- 
7600 (1988). Clones chosen for excision expressed the 
decapeptide tag and did not cleave X-gal in the presence 
of 2mM IPTG , thus remaining white. Clones with these 
characteristics represented 30% of the library. 50% of 

30 the clones chosen for excision contained a V H and V L as 
determined by restriction analysis. Since approximately 
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35 than 15% of the clones in the combined library would 
contain both V H and V L clones. The actual number obtained 
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was 15% of the library indicating that the process of 
combination was very efficient. 

16. Segregating DNA Homoloas For a V„ Antigen Binding 
Protein 

5 To segregate the individual clones containing DNA 

homologs that code for a V H antigen binding protein, the 
titre of the V H expression library prepared according to 

Example 12 WaS ucucj.iuj.ucu. xj.wj.aj.jr w.*.wai 

performed using methods well known to one skilled in the 
10 art. Briefly, serial dilutions of the library were made 
into a buffer containing 100 mM NaCl, 50 mM Tris-HCL at 
pH 7.5 and 10 mM MgCl 4 , 5 g/L yeast extract, 10 g/L NZ 
amine (casein hydrolysate) and 0.7% melted, 50 *C agarose. 
The phage, the bacteria and the top agar were mixed and 
15 then evenly distributed across the surface of a prewarmed 
bacterial agar plate (5 g/L NaCl, 2 g/L MgCl 4 , 5 g/L yeast 
extract, 10 g/L NZ amine (casein hydrolysate) and 15 g/L 
Difco agar. The plates were maintained at 37 # C for 12 to 
24 hours during which time period the lambda plaques 
2 0 developed on the bacterial lawn. The lambda plaques were 
counted to determine the total number of plaque forming 
units per ml in the original library. 

The titred expression library was then plated out so 
that replica filters could be made from the library. The 

2 5 replica filters will be used to later segregate out the 

individual clones in the library that are expressing the 
antigens binding proteins of interest. Briefly, a volume 
of the titred library that would yield 20,000 plaques per 
150 millimeter plate was added to 600 ul of exponentially 

3 0 growing E. coli cells and maintained at 37 °C for 15 min- 



the entire mixture distributed evenly across the surface 
3 5 of a prewarmed bacterial agar plate. This process was 

^ nnP1< .^ ^ 0 >~ * sufficient number of plates plate out a 
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total number of plaques at least equal to the library 
size. These plates were then maintained at 37 # C for 5 
hours. The plates were then overlaid with nitrocellulose 
filters that had been pretreated with a solution contain- 
5 ing 10 mM isopropyl-beta-D-thiogalactopyanoside (IPTG) and 
maintained at 37 "C for 4 hours. The orientation of the 
nitrocellulose filters in relation to the plate were 
marked by punching a hole with a needle dipped in water- 
proof ink through the filter and into the bacterial plates 

10 at several locations. The nitrocellulose filters were 
removed with forceps and washed once in a TBST solution 
containing 20 mM Tris-HCl at pH 7.5, 150 mM NaCl and 0.05% 
monolaurate (tween-20) . A second nitrocellulose filter 
that had also been soaked in a solution containing 10 mM 

15 IPTG was reapplied to the bacterial plates to produce 
duplicate filters. The filters were further washed in a 
fresh solution of TBST for 15 minutes. Filters were then 
placed in a blocking solution consisting 20 mM Tris-HCl at 
pH 7.5, 150 mM NaCl and 1% BSA and agitated for 1 hour at 

20 room temperature. The nitrocellulose filters were trans- 
ferred to a fresh blocking solution containing a 1 to 500 
dilution of the primary antibody and gently agitated for 
at least 1 hour at room temperature • After the filters 
were agitated in the solution containing the primary 

25 antibody the filters were washed 3 to 5 times in TBST for 
5 minutes each time to remove any of the residual unbound 
primary antibody. The filters were transferred into a 
solution containing fresh blocking solution and a 1 to 500 
to a 1 to 1,000 dilution of alkaline phosphatase conju- 

30 gated secondary antibody. The filters were gently 
agitated in the solution for at least 1 hour at room 



m n v k : t i \ f 1 s l a u a . n d o u n c 
3 5 filters were washed once in 
Tris-HCl at pH 7.5 and 150 



econaar 1 :;r. moav nt- 
a solution containing 20 mM 
mM NaCl. The filters were 
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from them with filter paper. The color was developed by 
placing the filter in a solution containing 100 mM Tris- 
HC1 at pH 9.5, 100 mM NaCl, 5 mM MgCl 2 , 0.3 mg/ml of nitro 
Blue Tetrazolium (NBT) and 0.15 mg/ml of 5-bromo-4-chloro- 
5 3 -indolyl -phosphate (BCIP) for at least 30 minutes at room 
temperature. The residual color development solution was 
rinsed from the filter with a solution containing 20 mM 
Tris-HCl at pH 7.5 and 150 mM NaCl. The filter was then 
placed in a stop solution consisting of 20 mM Tris-HCl at 
10 pH 2.9 and 1 mM EDTA. The development of an intense 
purple color indicates a positive result. The filters are 
used to locate the phage plaque that produced the desired 
protein. That phage plaque is segregated and then grown 
up for further analysis. 
15 several different combinations of primary antibodies 

and second antibodies were used. The first combination 
used a primary antibody immunospecif ic for a decapeptide 
that will be expressed only if the V H antigen binding 
protein is expressed in the proper reading frame to allow 
20 read through translation to include the decapeptide epi- 
tope covalently attached to the V H antigen binding protein. 
This decapeptide epitope and an antibody immunospecif ic 
for this decapeptide epitope was described by Green et 
al., Cell 28:477 (1982) and Niman et al. , Proc. Nat, ftqad. 
25 Sci. U.S.A. 80:4949 (1983). The sequence of the decapep- 
tide recognized is shown in Figure 11. A functional 
equivalent of the monoclonal antibody that is immuno- 
specif ic for the decapeptide can be prepared according to 
the methods of Green et al. and Niman et al. The secon- 
3 0 dary antibody used with this primary antibody was a goat 
antimouse IgG (Fisher Scientific) . This antibody in 

I'wwiTinsnprifir ^n>- +-h^ q+->~^nd t~pcH nn ™ouf;p TaO and did 

aary antibodies when usea according to tne aoove protocol 
determined that between 25% and 30% of the clones were 
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expressing the decapeptide and therefore these clones were 
assumed to also be expressing a V H antigen binding protein. 

In another combination the anti-decapeptide mouse 
monoclonal was used as the primary antibody and an affin- 
ity purified goat anti-mouse Ig, commercially available as 
part of the picoBlue immunoscreening kit from Stratagene 
Cloning System, La Jolla, CA, was use as the secondary 
antibody. This combination resulted in a large number of 
false positive clones because the secondary antibody also 
immunoreacted with the V H of the heavy chain. Therefore 
this antibody reacted with all clones expressing any V H 
protein and this combination of primary and secondary 
antibodies did not specifically detect clones with the V H 
polynucleotide in the proper reading frame and thus 
15 allowing expressing of the decapeptide. 

Several combinations of primary and secondary anti- 
bodies are used where the primary antibody is conjugated 
to fluorescein isothiocyanate (FITC) and thus the immuno- 
specif icity of the antibody was not important because the 
20 antibody is conjugated to the preselected antigen (FITC) 
and it is that antigen that should be bound by the V H 
antigen binding proteins produced by the clones in the 
expression library. After this primary antibody has bound 
by virtue that is FITC conjugated mouse monoclonal anti- 
2 5 body p2 5764 (ATCC #HB-9505) . The secondary antibody used 
with this primary antibody is a goat anti-mouse Ig 6 (Fisher 
Scientific, Pittsburgh, PA) conjugated to alkaline phos- 
phatase using the method described in Antibodies; A 

Laboratory Manual , Harlow and Lowe, eds., Cold Spring 
30 Harbor, New York, (1988). If a particular clone in the V H 
expression, library, expresses a V H binding protein that 



3 5 to form. 

The second combination of antibodies of the type uses 
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human IgG (Fisher Scientific, Pittsburgh, PA) - The secon- 
dary antibody used with this primary antibody is a goat 
anti-rabbit IgG conjugated to alkaline phosphatase using 
the methods described in Antibodie s A Laboratory Manual. 
5 Harlow and Lane, eds., Cold Spring Harbor, New York, 
(1988) . If a particular clone in the V H expression library 
expresses a V H binding protein that binds the FITC conju- 
gated to the primary antibody, the secondary antibody 
binds specifically and when developed the alkaline 
10 phosphatase causes a distinct purple color to form. 

Another primary antibody was the mouse monoclonal 
antibody p2 5764 (ATCC # HB-9505) conjugated to both FITC 
and 125 I. The antibody would be bound by any V H antigen 
binding proteins expressed. Then because the antibody is 
15 also labeled with 125 I, an autoradiogram of the filter is 
made instead of using a secondary antibody that is conju- 
gated to alkaline phosphatase. This direct production of 
an autoradiogram allows segregation of the clones in the 
library expressing a V H antigen binding protein of 
20 interest. 

17. Segregating DNA Homoloas For a V |t and V L that Form an 
Antigen Binding F v 

To segregate the individual clones containing DNA 
homologs that code for a V H and V L that form an antigen 

25 binding F v , an V H and V L expression library was titred 
according to Example 15. The titred expression library 
was then screened for the presence of the decapeptide tag 
expressed with the V H using the methods described in 
Example 16. DNA was then prepared from the clones to 

30 express the decapepide tag. This DNA was digested with 
the restriction endonuclease Pvn II to determine whether 



3 5 a V L DNA homolog. 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



113 



The clones containing both a V H and a V L DNA homolog 
were analyzed to determine whether these clones produced 
an assembled F v protein molecule from the V H and V L DNA 
homologs . 

5 The F v protein fragment produced in clones containing 

both V u and V. was visualized by immune precipitation of 
radiolabeled protein expressed in the clones. A 50 ml 
culture of LB broth (5 g/L yeast extract, 10 g/L and tryp- 
tone 10 g/L Nacl at pH 7.0) containing 100 ug/ul of 
10 ampicillin was inoculated with E. Coli harboring a plasmid 
contain a V H and a V L . The culture was maintained at 37 *C 
with shaking until the optical density measured at 550 nm 
was 0.5. The culture then was centrifuged at 3,000 g for 
10 minutes and resuspended in 50 ml of M9 media (6 g/L 
15 Na 2 HP0 4/ 3 g/L KH 2 P0 A , 0.5 g/L NaCi, 1 g/L NH 4 Cl, 2g/L 
glucose, 2 mM MgS0 4 and 0.1 mMgS0 A CaCl 2 supplemented with 
amino acids without methionine or cysteine. This solution 
was maintained at 37 *C for 5 minutes and then 0.5 mCi of 
35 S as HS0 4 (New England Nuclear, Boston, MA) was added and 
20 the solution was further maintained at #&C for an addi- 
tional 2 hours. The solution was then centrifuged at 300 
x g and the supernatant discarded. The resulting bacter- 
ial cell pellet was frozen and thawed and then resuspended 
for 10 minutes and the resulting pellet discarded. The 
25 supernatant was admixed with 10 ul of anti-decapeptide 
monoclonal antibody and maintained for 30-90 minutes on 
ice. 4 0 ul of protein G coupled to sepharose beads 
(Pharmacia, Piscataway, NJ) was admixed to the solution 
and the added solution maintained for 3 0 minutes on ice to 
allow an immune precipitate to form. The solution was 
centrifuged at 10,000 x g for 10 minutes and the resulting 



30 



35 resulting immune precipitate pellet was loaded onto a 
PhastGel Homogenous 20 gel (Pharmacia, Piscataway, NJ) 
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according to the manufacturer's directions. The gel was 
dried and used to expose X-ray film. 

The resulting autoradiogram is shown in Figure 12. 
The presence of V L that was immunoprecipitated because it 
5 was attached to the V H -decapepide tag recognized by the 
precipitating antibody. 

18. Generation of a Combinatorial Library Qt the 

Immunoglobulin R epertoire in Phacrs 

Vectors suitable for expression of V H , V L/ Fv and Fab 

10 sequences are diagrammed in Figures 7 and 9. As previ- 
ously discussed, the vectors were constructed by modifi- 
cation of Lambda Zap by inserting synthetic oligonucleo- 
tides into the multiple cloning site. The vectors were 
designed to be antisymmetric with respect to the Not I and 

15 EcoR I restriction sites which flank the cloning and 
expression sequences. As described below, this anti- 
symmetry in the placement of restriction sites in a linear 
vector like bacteriophage allows a library expressing 
light chains to be combined with one expressing heavy 

2 0 chains to construct combinatorial Fab expression 
libraries. Lambda Zap II V L II (Figure 9) is designed to 
serve as a cloning vector for light chain fragments and 
Lambda Zap II V H (Figure 7) is designed to serve as a 
cloning vector for heavy chain sequences in the initial 

25 step of library construction. These vectors are engi- 
neered to efficiently clone the products of PCR 
amplification with specific restriction sites incorporated 
at each end. 

A. PCR Amplification of Antib ody Fragments 

-) n T^n r»r-"P ^tnpl i fiVaf i^n n f *r,p*J? i q o 1 a t O H <F*-^n> -^l^o^ 

to clone and express heavy chain sequences including Fti 
and kappa chain sequences. The oligonucleotides primers 

Mqpd for these amplifications are presented in Tables 1 
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and 2 . The primers are analogous to those which have been 
successfully used in Example 6 for amplifications of V H 
sequences. The set of 5- primers for heavy chain ampli- 
fication were identical to those previously used to 
5 amplify V H and those for light chain amplification were 
chosen on similar principles, Sastry et al., proc. Natl. 
A^d. sci. USA , 8G: 5728 (1989) and Orland et al., Proc 
*n»*. fici. USA . 8G:3833 (1989). The unique 3' 
primers of heavy (igGl) and light (k) chain sequences, were 
10 chosen to include the cysteines involved in heavy-light 
chain disulfide bond formation. At this stage no primer 
was constructed to amplify lambda light chains since they 
constitute only a small fraction of murine antibodies. In 
addition, Fv fragments have been constructed using a 3' 
15 primer which is complementary to the to the mRNA in the J 
(joining) region (amino acid 128) and a set of unique 5' 
primers which are complementary to the first strand cDNA 
in the conserved N-terminal region of the processed 
protein. Restriction endonuclease recognition sequences 
20 are incorporated into the primers to allow for the cloning 
of the amplified fragment into a lambda phage vector in a 
predetermined reading frame for expression. 

B. T.ibrarv C onstruction 

The construction of a combinatorial library was 

25 accomplished in two steps. In the first step, separate 
heavy and light chain libraries were constructed in Lambda 
Zap II V H and Lambda Zap II V L II respectively. In the 
second step, these two libraries were combined at the 
antisymmetric EcoRl sites present in each vector. This 

30 resulted in a library of clones each of which potentially 
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parent animal. Lambda Zap II V H expression vector has been 
3 5 used to create a library of heavy chain sequences for DNA 

-rained by PCR amplifications of mRNA isolated from the 
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spleen of a 129 G 1x + mouse previously immunized with p- 
nitrophenyl phosphonamidate (NPN) antigen 1 according to 
formula I (Figure 13) conjugated to keyhole limpet 
hemocyanin (KLH) . 
5 The NPN-KLH conjugate was prepared by admixture of 

250 ul of a solution containing 2.5 mg of NPN according to 
formula 1 (Figure 12) in dimethyl formamide with 750 ul of 
a solution containing 2 mg of KLH in 0.01 M sodium phos- 
phate buffer (pH 7.2). The two solutions were admixed by 

10 slow addition of the NPN solution to the KLH solution 
while the KLH solution was being agitated by a rotating 
stirring bar. Thereafter the admixture was maintained at 
4°C for 1 hour with the same agitation to allow conjuga- 
tion to proceed. The conjugated NPN-KHL was isolated from 

15 the nonconjugated NPN and KLH by gel filtration through 
Sephadex G-25. The isolated NPN-KLH conjugate was used in 
mouse immunizations as described in Example 3. 

The spleen mRNA resulting from the above immuniza- 
tions was isolated and used to create a primary library of 

20 V H gene sequences using the Lambda Zap II V H expression 
vector. The primary library contains 1.3 x 10 6 pfu and has 
been screened for the expression of the decapeptide tag to 
determine the percentage of clones expressing Fd 
sequences. The sequence for this peptide is only in frame 

25 for expression following the cloning of an Fd (or V u ) 

H * 

fragment into the vector. At least 80% of the clones in 
the library express Fd fragments based on immuno-detection 
of the decapeptide tag. 

The light chain library was constructed in the same 
3 0 way as the heavy chain and shown to contain 2.5 x 10 6 
members. Plaque screening, using the anti-kappa chain 
antibody, indicated that 60% of the library contained 
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Once obtained, the two libraries were used to 
construct a combinatorial library by crossing them at the 
EcoRI site. To accomplish the cross, DNA was first 
purified from each library. The light chain library was 
5 cleaved with Mlul restriction endonuclease, the resulting 
5» ends dephosphorylated and the product digested with 
EcoRI. This process cleaved the left arm of the vector 
into several pieces but the light arm containing the light 
chain sequences, remained intact. In a parallel fashion, 

10 the DNA of heavy chain library was cleaved with Hindlll, 
dephosphorylated and cleaved with EcoR I, destroying the 
right arm but leaving the left arm containing the heavy 
chain sequences intact. The DNA's so prepared were then 
combined and ligated. After ligation only clones which 

15 resulted from combination of a right arm of light chain- 
containing clones reconstituted a viable phage. After 
ligation and packaging, 2.5 x 10 7 clones were obtained. 
This is the combinatorial Fab expression library that was 
screened to identify clones having affinity for NPN. To 

20 determine the frequency the phage clones which co-express 
the light and heavy chain fragments, duplicate lifts o the 
light chain, heavy chain and combinatorial libraries were 
screened as above for light and heavy chain expression. 
In this study of approximately 500 recombinant phage 

25 approximately 60% co-expressed light and heavy chain 
proteins. 

C. Antigen Binding 

All three libraries, the light chain, the heavy chain 
and Fab were screened to determine if they contained 
3 0 recombinant phage that expressed antibody fragments bind- 

nciirv: . P- . jupit 1 .. sDfcsiev. -vs.- iaur- 

Duplicate screens of 80,000 recombinant phage from the 
3 5 light chain library and a similar number from the heavy 

- n a : * - i t: r n : ■/ d i d n o r ; . den t i f y a n v ~ I oner, wh i c h b our. d t h • 
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antigen. In contrast, the screen of a similar number of 
clones from the Fab expression library identified many 
phage plaques that bound NPN (Figure 15) . This observa- 
tion indicates that under conditions where many heavy 
5 chains in combination with light chains bind to antigen 
the same heavy or light chains alone do not. Therefore, 
in the case of NPN, it is believed that there are many 
heavy and light chains that only bind antigen when they 
are combined with specific light and heavy chains 

10 respectively. 

To assess the ability to screen large numbers of 
clones and obtain a more quantitative estimate of the 
frequency of antigen binding clones in the combinatorial 
library, one million phage plaques were screened and 

15 approximately 100 clones which bound to antigen were 
identified. For six clones which were believed to bind 
NPN, a region of the plate containing the positive and 
approximately 20 surrounding bacteriophage plaques was 
"cored", replated, and screened with duplicate lifts 

20 (Figures 15). As expected, approximately one in twenty of 
the phage specifically bind to antigen. "Cores" of 
regions of the plated phage believed to be negative did 
not give positives on replating. 

To determine the specificity of the antigen-antibody 

2 5 interaction, antigen binding was competed with free 

unlabeled antigen as shown in Figure 16. Competition 
studies showed that individual clones could be distin- 
guished on the basis of antigen affinity- The concentra- 
tion of free hapten required for complete inhibition of 

3 0 binding varied between 10-100 x 10- 9 M suggestion that the 

expressed Fab fragments had binding constants in the 
nanomolar range . 



3 5 in preparation for characterization of the protein 

products able to bind NPN as described in Example 19C, a 
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plasmid containing the heavy and light chain genes was 
excised from the appropriate "cored" bacteriophage plaque 
using M13mp8 helper phage. Mapping of the excised plasmid 
demonstrated a restriction pattern consistent with incor- 
5 poration of heavy and light chain sequences. The protein 
products of one of the clones was analyzed by ELISA and 
Western blotting to establish the composition of the NPN 
binding protein- A bacterial supernate following IPTG 
induction was concentrated and subjected to gel fiitra- 

10 tion. Fractions in the molecular weight range 40-60 kD 
were pooled, concentrated and subjected to a further gel 
filtration separation. As illustrated in Figure 17, ELISA 
analysis of the eluting fractions demonstrated that NPN 
binding was associated with a protein of molecular weight 

15 about 50 kD which immunological detection showed contained 
both heavy and light chains. A Western blot (not shown) 
of a concentrated bacterial supernate preparation under 
non-reducing conditions was developed with anti- 
decapeptide antibody. This revealed a protein band of 

20 molecular weight of 50 kD. Taken together these results 
are consistent with NPN binding being a function of Fab 
fragments in which heavy and light chains are covalently 
1 inked . 

20. Flp recombinase-catalvzed Recombination 

2 5 Experiments directed to the in vivo recombination of 

two lambda vectors using flp recombinase-catalyzed recom- 
bination are described. The flp recombination site was 
introduced into the phage vectors using 39mer synthetic 
oligonucleotides. The sequence of the flp site utilized 

3 0 for recombination was derived from several references 



strategy. This was accomplished by making a point muta- 
35 tion which has little or no effect on its ability to allow 
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3367 (1985)). However, this point mutation is not 
required for the system to function. The oligonucleotides 
were further designed to be inserted in the EcoRl sites of 
the Lambda Zap II V H and Lambda Zap II V L vectors so that 
5 only one flanking EcoRl sites would be regenerated (see 
Figure 18) . The flanking sequences are not essential to 
the system. 

The following sequences were inserted into Lamba Zap II V. ri : 

(110) Oligo 79 EcoRl 
10 AATTCGAAGTTCCTATTCTCTAAAAAGTATAGGAACTTC 3 1 

( 111) Oligo 80 G CTT CAAG G AT AAG AG ATTTTT CAT AT C CTT G AAGTT AA 

5' 

The following sequences were inserted into Lambda Zap II 
15 (112) o 1 i g o 8 l 

AATTGAAGTTCCTATTCTCTAAAAAGTATAGGAACTTCG EcoRl 3' 
( 113 ) Oligo 82 CTTCAAGGATAAGAGATTTTTCATATCCTTGAAGCTTAA 

Vectors were constructed as follows. The first two 
20 oligonucleotides were mixed (0.5 fig oligo 79, 0.5 jig oligo 
80 , 1 /il 200 mM Tris f pH 7.4, 20 mM MgCl, 500 mM NaCl, and 
H 2 0 to 10 /xl) , heated to 85"C 5 min. , and allowed to cool 
to room temperature over 1 hour in a water bath. The 
procedure was repeated using oligos 81 and 82. 
25 Ligation into vector arms was accomplished by 

digesting Lambda Zap V H and Lambda Zap II V L with 3U/^g 
EcoRl according to standard digestion procedure. After 
phenol/chloroform extraction, DNA was precipitated with 
EtOH. The vector was not phosphatase treated so that the 
3 0 oligonucleotides could be inserted without kinase treat- 
ment, thus preventing multiple tandem ol iqonucleot ide 

. i^curuiny sianaaru ^icjaLion protocol ^ see Maniac is ei 
3 5 al . , supra ) . Ligation mixes were packaged using Gigapack 
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Gold™ (Stratagene Cloning Systems, San Diego, CA) 
according to the protocol recommended in the manual. 

Following packaging, the vectors were screened. 

TM 

Packaged DNA was plated according to the Gigapack Gold 
5 manual procedure on NZY agar with approximately 400 pfu 
per 100 mm Petri dish. Duplicate plaque lifts were done 
according to the protocol in the Predigested ZapII Cloning 
Manual (Stratagene Cloning Systems, San Diego, CA) on 
nitrocellulose filters. Denaturat ion and fixation of DNA 

10 onto the membranes is also described in the manual. 
Prehybridization was performed according to pBluescript II 
Exo/Mung DNA Sequencing System™ instruction manual 
(Stratagene Cloning Systems, San Diego, CA) for oligo- 
nucleotide probes (pg 6) . Hybridization was performed 

15 overnight using 32 P kinased oligo 79 (0.5 x 10 6 cpm/ml) 
according to the pBluescript manual (Stratagene Cloning 
Systems, San Diego, CA) . Oligo 79 was kinased using 
standard 32 P gamma ATP labelling techniques (see Maniatis 
et al., supra ) . Filters were washed in 6X SSC, 0.1% SDS, 

2 0 3 times at room temperature, once at 55 *C and finally at 
59 °C. Each was washed for approximately 10 minutes. 
Positive plaques were identified using X-ray autoradio- 
graphy. Twelve duplicate plaques were cored in 500 Ml SM, 
20 Ml chloroform. These plaques were sufficiently well 

25 isolated that secondary screening was not required. The 
cored plaques were excised according to the Predigested 
Zap II Cloning Manual (Stratagene Cloning Systems, San 
Diego, CA) and DNA from single ampicillin resistant 
colonies was sequenced using minipreped DNA and the T7 and 

30 T3 primers according to the DSK 3 5S Sequencing kit 
(Stratagene Cloning Systems, San Diego, CA) . Clones with 
flp sites in the correct orientation ^nd onnosite n^i^n- 

;.esi in vivo Hp- mediated recombination. 

In vivo f Ip-mediated recombination was accomplished 
as follows. Flp recombinase was expressed off the tac 
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promoter on a plasmid, pCS3, in E. coli MM294 strain 
(Lebreton et al. 1988) . This is a low copy number plasmid 
with the pACYC origin of replication and contains a 
chloramphenicol resistance gene. 
5 5 x 10 8 cells were coinfected with FLPHC and FLPLC 

vectors at an moi of 5 and 10 pfu each per cell. 
Combinations of FLPHC+ and FLPLV+, or FLPHC- and FLPLC- , 
or FLPHC+ and FLPLC- were tested. 

Overnight cultures of MM294 (pCS3 ) were 
10 spun down and resuspended in lOmM MgSO A at a density of 
OD 600 = 1. 0. The appropriate amounts of phage were added 
to 0.5 ml of cells and allowed to adhere at 37 *C for 15 
minutes. 50 ml of NZY was added to each flask and incu- 
bated for 2 hours with shaking. 250 jxl of chloroform was 
15 added to 25 ml of lysate and incubated for 15 minutes at 
room temperature. The supernatants were titred and 
screened for phage containing both Lambda Zap II V H left 
arms and Lambda Zap II V L right arms. Probes to identify 
Zap II v H left arms and Lambda Zap II V L right arms were 
20 designed by identifying unique sequences from the known 
sequence of the vectors. 

The Lambda Zap II V H left arm probe had the following 
sequence: 

(114) CTAGTTACCCGTACGACCCCCCCGTTCCGGACTACGCTTCTTAATAG 3 1 
25 This sequence hybridizes to the decapeptide sequence of 

the Lambda Zap II V H . The Lambda Zap II V L right arm probe 
had the following sequence: 

(115) 5 1 GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3 1 

This sequence hybridizes to the sequence from the Sacl 
30 site to the former Notl site of the Lambda Zap II V L 
vector. 

The screening procedure used w^^ t-ho c??mn -> ^ *-h^*- 



^xLii oaSSL, x-sSDS j Limes at room temperature and twice at 
60 °C. Plaques which hybridized to both probes were 
identified by X-ray autoradiography, cored, excised and 
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digested to determine if recombination had occurred. 
Control plaques identified as hybridizing to only one 
probe and to neither probe were also cored. Diagnostic 
restriction digests were PvuII, Pvul, Xhol, Xhol/Pvul, 
5 Sacl, Sac/PVul f Notl, Xbal, Seal, Spel, Spel/Puul. 
Restriction digest results verified that recombination at 
the flp site occurred in vivo in cells expressing the flp 
recombinase gene and not in control SURE™ E. cpU cells 
(Stratagene Cloning Systems, San Diego, CA) which do not 
10 normally express flp recombinase. 

Efficiency of recombination according to the number 
of plaques identified as hybridizing to both probes was 
initially between about 5-10%. Changes to the protocol 
can be made, however, which will improve the efficiency of 
15 recovery of recombined vectors. For example, by adding 
selectable marker sequences to the left and right arms of 
the vectors, up to 100% of target recombinants can be 
identified (Figure 20) . Adding selection systems to 
ensure that all recombinants contain inserts will also 
20 increase the efficiency of identifying the desired clones. 

In Example 19 a relatively restricted library was 
prepared because only a limited number of primers were 
used for PGR amplification of Fd sequences. The library 
is expected to contain only clones expressing kappa/gamma 
25 sequences. However, this is not an inherent limitation of 
the method since additional primers can be added to 
amplify any antibody class or subclass. Despite this 
restriction we were able to isolate a large number of 
antigen binding clones. Of interest is how a phage 
library prepared as described herein compares with the in 
vivo antibody repertoire in terms of size, characteristics 



30 



- «■ \ 
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3 5 different antigen specificities is often quoted. With 
some of the reservations discussed below, a phage library 

- ; -i ■ -r larqe can readily :.: struct*-- ■) i-v 
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modification of the current method. In fact once an 
initial combinatorial library has been constructed, heavy 
and light chains can be shuffled to obtain libraries of 
exceptionally large numbers. 
5 In principle, the diversity characteristics of the 

naive (unimmunized) in vivo repertoire and corresponding 
phage library are expected to be similar in that both 
involve a random combination of heavy and light chains. 
However, different factors will act to restrict the 
10 diversity expressed by an in vivo repertoire and phage 
library. For example a physiological modification such as 
tolerance will restrict the expression of certain anti- 
genic specificities from the in vivo repertoire but these 
specificities may still appear in the phage library. For 
15 example, the representation of mRNA for sequences 
expressed by stimulated B-cells can be expected to 
predominate over those of unstimulated cells because of 
higher levels of expression. Different source tissues 
( e.g. , peripheral blood, bone marrow or regional lymph 
2 0 nodes) and different PCR primers (e.g. , ones expected to 
amplify different antibody classes) may result in library 
with different diversity characteristics. 

Another difference between in vivo repertoire and 
phage library is that antibodies isolated from the former 
2 5 may have benefited from affinity maturation due to somatic 
mutations after combination of heavy and light chains 
whereas the latter randomly combines the matured heavy and 
light chains. Given a large enough phage library derived 
from a particular in vivo repertoire, the original matured 
30 heavy and light chains will be recombined. However, since 
one of the potential benefits of this new technology is to 



3 5 for the absence of somatic mutation and clonal selection. 
Three procedures are made readily available through the 
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genesis may be performed on the CDR's and the resulting 
Fabs can be assayed for increased function- Second, a 
heavy or a light chain of a clone which binds antigen can 
be recombined with the entire light or heavy chain 
5 libraries respectively in a procedure identical to the one 
used to construct the combinatorial library. Third, iter- 
ative cycles of the two above procedures can be performed 
to further optimize the affinity or catalytic properties 
of the immunoglobulin. It should be noted that the latter 
10 two procedures are not permitted in B-cell clonal selec- 
tion which suggests that the methods described here may 
actually increase the ability to identify optimal 
sequences . 

Access is the third area where it is of interest to 
15 compare the in vivo antibody repertoire and phage library. 
In practical terms the phage library is much easier to 
access. The screening methods allow one to survey at 
least 50,000 clones per plate so that 10 6 antibodies can be 
readily examined in a day. This factor alone should 
20 encourage the replacement of hybridoma technology with the 
methods described here. The most powerful screening 
methods utilize selection which may be accomplished by 
incorporating selectable markers into the antigen such as 
leaving groups necessary for replication of auxotrophic 

2 5 bacterial strains or toxic substituents susceptible to 

catalytic inactivation. There are also further advantages 
related to the fact that the in vivo antibody repertoire 
can only be accessed via immunization which is a selection 
on the basis of binding affinity. The phage library is 

3 0 not similarly restricted. For example, the only general 

method to identify antibodies with catalytic properties 
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35 principle, be assayed directly. The ability to directly 
assay large numbers of antibodies for function may allow 
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not well defined or synthesis of the transition state 
analog is difficult. Assaying for catalysis directly 
eliminates the bias of the screening procedure for 
reaction mechanisms pejorative to a synthetic analog and 
5 therefore simultaneous exploration of multiple reaction 
pathways for a given chemical transformation are possible. 

Although we have given examples of several screening 
methods, it should be clear to one skilled in the art that 
alternative methods ot screening , usch as by panning dels 
10 or particles expressing the protein product on their sur- 
face would essentially be equivalent. If the expressed 
gene products of interest are RNA molecules instead of 
proteins, screening could be accomplished by nucleic acid 
hybridization or by detecting some functional property of 
15 th eRNA, such as ribozyme catalysis. 

The methods disclosed herein describe generation of 
Fab fragments which are clearly different in a number of 
important respects from intact (whole) antibodies. There 
is undoubtedly a loss of a affinity in having monovalent 
20 Fab antigen binders but this can be compensated by selec- 
tion of suitably tight binders. For a number of applica- 
tions such as diagnostics and biosensors it may be prefer- 
able to have monovalent Fab fragments . For applications 
requiring Fc effector functions, the technology already 
25 exists for extending the heavy chain gene and expressing 
the glycosylated whole antibody in mammalian cells. 

The ideas presented here address the bottle neck in 
the identification and evaluation of antibodies. It is 
now possible to construct and screen at least three orders 
30 of magnitude more clones with mono-specificity than previ- 
ously possible. The potential applications of the method 

Dicistronic DNA 

3 5 A method based on PCR amplification that fuses heavy 
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complete antigen binding domain of a Fab protein fragment 
composed of a heavy and a light chain. Schematic diagrams 
of an immunoglobulin molecule composed of heavy and light 
chains containing constant and variable regions is shown 
5 in Figure 1. Human heavy chain IgG and human kappa light 
chain are diagrammatically sketched in Figures 2A and 2B, 
respectively. To accomplish this procedure, immunoglobu- 
lin heavy and light chain primers were designed to produce 
a region of homology between two polymerase chain reaction 

■ 

10 (PCR) products. The complementary regions have been shown 
to hybridize predominantly under conditions where one set 
of primers ("inside primer pair") is used in a limiting 
amount relative to the other set of primers ("outside 
primer pair") . After the 3» ends of the PCR products have 

15 hybridized, the DNA polymerase has been shown to extend 
the ends creating a fusion sequence carrying the unique 
sequences of both PCR fragments separated by one copy of 
region X cistronic bridge. A two-step cloning procedure 
is thus avoided. When the recombinant sequence is then 

20 inserted into an expression vector such as ImmunoZAP, a 
fusion production capable of simultaneously expressing the 
heavy and light chains can be produced. 

The strategy used for producing immunoglobulin heavy 
and light chain PCR dicistronic DNA is shown schematically 

25 in Figure 21. Regions of the immunoglobulin heavy chain 
coding strand are designated V Hf C H 1, C H 2 , and C H 3 corres- 
ponding to functional regions in the protein. The corres- 
ponding regions of the non-coding strand are designated by 
a prime (■). Regions V L and C L are similarly labelled for 

3 0 the kappa light chain. This procedure can also be per- 
formed using lambda light chain specific regions. A 



35 inside primers. 

Overlapping oligonucleotide primers used in the 
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designed to encode the following: amino acids of 225 to 
230 of the IgG heavy chain hinge region which are common 
to all human IgG isotypes; an Spe I restriction site; two 
stop codons; a ribosome binding site; a periplasmic (pelB) 
5 leader sequence (Better, et al., Science . 240:1041-1043 
(1988); Lei f et al., J. Bacterid , , 169:4379-4383 (1988)); 
a Sac I restriction site which encodes amino acids 1 and 
2 of the mature kappa light chain; and amino acids 3 to 8 
of the mature kappa light chain. The X region was 
10 designed to contain a ribosome binding site and a pelB 
leader to ensure expression of the light chain. 
Nucleotide sequences for all human and mouse PCR primers, 
both inside and outside, are listed in Table 11. Primers 
followed by a prime ( 1 ) represent non-coding strand 
15 sequences . 

Table 11 

Human and Mouse PCR Primers 
Seq. 

Id. No. Human 

20 (117) V H 

(118) C H 1' 



(119) V L 

25 

(120) C L f 

Mouse 
30 (121) V H 



[12 3] V. 

35 



5 1 -GTCCTGTCCGAGGTGCAGCTGCTCGAGTCTGG- 3 1 
5 1 -AATAACAATCCAGCGGCTGCCGTAGGCAATAGGT 
ATTTCATTATGACTGTCTCCTTGCTATTAACTAG 
TACAAGATTTGGGCTC- 3 1 
5 1 -GCCTACGGCAGCCGCTGGATTGTTATTAATCGCT 
GCCCAACCTGCCATGGCTGAGCTCGTGATGACCC 
CAGTCTCC-3 1 
5 1 -TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 
GCTCTTTGTG ACGGGCGAACTC- 3 1 

5 1 -AGGTCCAGCTGCTCGAGTCTGG-3 1 

-j i A'i'ACAA ± vJ ^TGGGCACAA i - 
d 1 - GCCTACGGCAGCCGCTGGATTGTTATTAATCGC 
TGCCCAACCTGCCATGGCTGAGCTCGTGATGAC 
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( 12 4 ) C L 1 5 1 -TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA-3 

The overlapping regions of the human C H 1» inside and 
v L inside primers are illustrated in Figure 22. The heavy 
5 chain downstream C H 1' inside primer sequence is written 3 1 
to 5 1 and the light chain upstream V L inside primer 
sequence is written 5' to 3'. The complementary PCR 
product strands, and not the primer strands, cross-prime 
to create the dicistronic molecule. bold nucleotides 

10 represent regions where the C H 1' inside primer hybridizes 
to the 3 • end of C H 1 on human IgG heavy chain mRNA or where 
the V L inside primer hybridizes to the 5' end of V L frame- 
work on human kappa light chain cDNA. The amino acid and 
nucleotides in italics represent changes in sequence from 

15 the original pelB leader sequence. 

At amino acid 15 of the pelB leader sequence, the 
codon was changed from CTC to ATC resulting in a conserv- 
ative amino acid change from a leucine to an isoleucine as 
shown in Figure 22 and Table 11. Hydrophobic amino acids 

20 in the core region of periplasmic leader sequences have 
been shown to be essential for correct processing of the 
leader sequence and transport of the mature protein to the 
periplasm. Oliver, in Neidhardt, R.C. (ed.), Escherichia 
coli and Salmonella Tvnhimurium. , Am. Soc. Microbiol., 

25 1:56-69 (1987) . The nucleotide changes were made to allow 
for the artifactual insertion of one or two dATPs at the 
3 ! end of the overlapping dicistronic molecules. Thermus 
aquaticus (Taq) DNA polymerase may add a dATP to the 3' 
end of the PCR product because of terminal transferase 

30 activity. Jiang, etg al. Oncogene , 4:923-928 (1989). The 
additional dATP would then cause a mismatch between the 



_ias web. ■ -t ^ - nereiort i.dnqt- 

35 dTTPs in this position of the oligonucleotide primers 

would allow proper base pairing if up to two dATPs were 
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The kappa light chain PCR product was designed to termi- 
nate at a position where two dTTPs occur 5' of the end of 
the product and did not require alterations of the nucleo- 
tide sequence. Nucleotides were changed in the kappa 
5 light chain primer encoding the pelB leader sequence with- 
out introducing amino acid changes in order to decrease 
the number of mismatches between the primer and the leader 
sequence of the kappa light chain mRNA as shown in Figure 

22 and Table II. 
10 All primers were synthesized on an Applied Biosystems 

DNA synthesizer, Model 381A, following the manufacturer's 
instructions . 

22. Preparation of a V n -and V L -Codina Repertoire 
A. Preparatio n of a v n -and V L -Codina RBPertoire from 3 
15 Human cDN A Combinatorial Library 

Cloned DNA, previously isolated from a combinatorial 
library that encodes human Fab fragments which bind 
tetanus toxoid (TT) was used as a template for preparing 
a V H -and V L -coding repertoire. Mullinax, et al- f supra . 
20 Briefly, the combinatorial library was prepared by the 
following approach. Volunteer donors, who had been pre- 
viously immunized against tetanus but had not received 
booster injections within the last year, received injec- 
tions on 2 consecutive days of 0.5 milliliters (ml) of 
25 alum-absorbed tetnus toxoid (TT) (40 microgram/ml (ug)/ml) 
(Connaught Laboratories, Swiftwater, Pennsylvania), 

One hundred ml of blood was drawn from the volunteers 
6 days post injection and anticoagulated with a mixture of 
0,14 M citric acid, 0.2 M trisodium citrate, and 0.22 M 
3 0 dextrose. The peripheral blood lymphocytes (PBLs) were 



degrees Celsius (25 U C). Isolated PBLs were washed twice 
35 with phosphate buffered saline (PBS) (150 mM sodium 

~Moridp anri 150 rr.M sodium phosphate, dH 7.? at 25 m C), 
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Total RNA was then purified from the PBLs (10 6 B cells 
per ml blood per 100 ml of blood) for an enriched source 
of B-cell mRNA coding for antiTT IgG using an RNA isola- 
tion kit according to manufacturer's instructions 
5 (Stratagene, La Jolla, California) and also described by 
Chomczynski et al. f Ap^l - Biochero. . 162:156-159 (1987). 
Briefly, the isolated PBLs were homogenized in 10 ml of a 
denaturing solution containing 4.0 M guanine isothiocyan- 
n ik m snHfim nifrato »t dH 7 . 0 .- and 0.1 M beta- 

V»* WW f ******* A A wr w w» v » — *T ' 

10 mercaptoethanol . One ml of sodium acetate at a concen- 
tration of 2 M at pH 4.0 was admixed with the homogenized 
cells. Ten ml of phenol that had been previously satur- 
ated with H 2 0 was also admixed to the denaturing solution 
containing the homogenized cells. Two ml of a chloroform: 

15 isoamyl alcohol (24:1 v/v) mixture was added to this homo- 
genate. The homogenate was mixed vigorously for ten 
seconds and maintained on ice for 15 minutes. The homo- 
genate was then transferred to a thick-walled 50 ml 
polypropylene centrifuged tube (Fisher Scientific Company, 

20 Pittsburgh, Pennsylvania). The solution was centrifuged 
at 10,000 x g for 20 minutes at 4'C. The upper RNA- 
containing aqueous layer was transferred to a fresh 50 ml 
polypropylene centrifuge tube and mixed with an equal 
volume of isopropyl alcohol. This solution was maintained 

25 at -20 °C for at least one hour to precipitate the RNA. 
The solution containing the precipitated RNA was centri- 
fuged at 10,000 x g for twenty minutes at 4*C. The 
pelleted total cellular RNA was collected and dissolved in 
3 ml of the denaturing solution described above. Three ml 

3 0 of isopropyl alcohol was added to the re-suspended total 
cellular RNA and inverted to mix. This solution was main- 
f.inpH -oot fnr *t- ipRst 1 hour to nrecipitate the 



pelleted RNA was wasnea once with a solution containing 
75% ethanol. The pelleted RNA was dried under vacuum for 
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15 minutes and then re-suspended in diethyl pyrocarbonate 
(DEPC) treated (DEPC-H 2 0) 1^0) . 

Messenger RNA (mRNA) was prepared from the total 
cellular RNA using methods described in Molecula r Cloning 
5 A Laboratory Manual , Maniatis et al., eds., Cold Spring 
Harbor, NY, (1982). Briefly, 500 mg of the total RNA 
isolated from a PBLs prepared as described above was re- 
suspended in one ml of IX sample buffer (1 mM Tris-HCl, 
(Tris [ hydroxy lnethyl-aninamethane ]) pK 7.5; 0.1 mM EDTA 
10 (disodium ethylene diamine tetra-acetic acid) , 0.5 M NaCl) 
and maintained at 65 *C for five minutes and then on ice 
for five more minutes. The mixture was then applied to an 
oligo-dT (Stratagene) column that was previously prepared 
by washing the oligo-dT with a solution containing 10 mM 
15 Tris-HCl, pH 7.5; 1 mM EDTA, 0.5 M NaCl. The eluate was 
collected in a sterile polypropylene tube and reapplied to 
the same column after heating the eluate for five minutes 
at 65 °C. The oligo dT column was then washed with 0.4 ml 
of high salt loading buffer consisting of 10 mM Tris-HCl 
20 at pH 7.5, 500 mM sodium chloride, and 1 mM EDTA. The 
oligo dT column was then washed with 2 ml of 1 X low salt 
buffer consisting of 10 mM Tris-HCl at pH 7.5, 100 mM 
sodium chloride, and 1 mM EDTA. The messenger RNA was 
eluted from the oligo dT column with 0.6 ml of buffer 
25 consisting of 10 mM Tris-HCl at pH 7.5, and ImM EDTA. The 
messenger RNA was purified by extracting this solution 
with phenol/chloroform followed by a single extraction 
with 100% chloroform. The messenger RNA was concentrated 
by ethanol precipitation and re-suspended in DEPC H 2 0. 
3 0 The messenger RNA isolated by the above process 

contains a plurality of different V H and V L coding poly- 



,,-un reaction with a nrst-strand synthesis Kit according 
to manufacturer's instructions (Stratagene) by using an 
oligo (dT) primer for the light chain and specific 
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10 



30 



primer, C H 1» , for the heavy chain. Mullinax et al., su^ra. 
In a typical 50 /a1 transcription reaction, 5 ug of PBL 
mRNA in water was first hybridized (annealed) with 200 ng 
(50.0 pmol) of an oligo (dT) primer for the light chain. 
In a separate reaction, 5 ug of PBL mRNA in water was 
first hybridized (annealed with 200 ng (20 pmol) of the 
heavy chain primer, C H 1\ at 65'C for five minutes. 
Subsequently, the mixture was adjusted to 0.5 mM each of 
dATP, dCTP, dGTP and dTTP , 50 mM Tris-HCl at pK 8.3, 3 xnM 
MgCl 2 75 mM KC1, 10 mM DTT, 20 units of RNase block II 
(Stratagene) , and 20 units of Moloney-Murine Leukemia 
virus reverse transcriptase (Stratagene Cloning Systems), 
was added and the solution was maintained for 1 hour at 
37 -C. PCR amplification of the heavy and light chain 
15 sequences was done separately using 0.25-0.5 ug of first- 
strand synthesis product as template with sets of primer 
pairs using Taq DNA polymerase as described in Example 23. 

The PCR amplified light chain DNA fragments were then 
digested with Sac I and Xba I and ligated into a modified 
20 Lambda Zap II vector as prepared in Example 29 to form a 
light chain ImmunoZap Library (ImmunoZAP L; Stratacyte, La 
Jolla, California) . The PCR amplified heavy chain DNA was 
digested with Spe I and Sho I and ligated into a different 
modified Lambda Zap II vector as prepared in Example 27 to 
2 5 form a heavy chain ImmunoZap Library (ImmunoZAP H; 
Stratacyte) . The resulting libraries were amplified and 
the resulting DNA was packaged into bacteriophage with in 
vitro packaging extract, Gigapack II gold (Stratagene) and 
used to infect E. coli strain XLl-Blue (Stratagene) . 

To construct a library for coexpression, the right 
art of the heavy chain library phage DNA was digested with 



4. \ 



Jb right arm of ImmunoZAP with kappa light chain inserts. 
Both products were then digested with EcoRI and ligated 

■ ■ r^t.r . t combinatorial library -hat -ncodo^ human Fab 
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fragments including those specific for TT. Mullinax, et 
al. , supra . 

Reactive plaques were first identified by binding to 
tetanus toxoid as described in Example 31- Bacteriophage 
5 from purified reactive plaques were then converted to the 
plasmid format by in vivo excision with R408 helper phage 
(Stratagene) following methods described in Example 31 and 
familiar to one skilled in the art. Short, et al., Nucl. 
Acids. Res. . 16:7583-7600 (19SS) . The resulting purified 
10 plasmid DNA encoding heavy and light chain was then used 
in PCR reactions as described below in Example 23. 

B. Preparation of a V, r and V L -Codina Repe rtoire from 

mRNA from Tissues and Cells 
(i) Human 

15 Purified populations of PBLs, other lymphocytes, and 

hybridomas which express immunoglobulins including IgG, 
IgM, IgE, IgD, and IgA are used as sources for isolating 
mRNA encoding immunoglobulins. PBL's and other immuno- 
globulin expressing lymphocytes are isolated from either 

20 spleen, lymphoid tissue or plasma. Following purification 
of the cells, total RNA is then purified from these cells 
using a RNA isolation kit (Stratagene) as described in 
Example 22a. The purified RNA is then converted to cDNA 
with a first-strand synthesis kit as described in Example 

2 5 22a. The resultant cDNA is then used as a template in PCR 

amplification reactions as described below in Example 23 
for the production of dicistronic molecules expressing 
heavy and light chains. 

(ii) Mouse 

10 Populations of cells described ahnvp ^an Hn ; cni^ P ^ 

• ,. ... ,.. * ... U. ... ...... i. 

loove ana useu as templates in PCR amplification reac- 
tions. cDNA synthesized from mRNA isolated from a mouse 

3 5 anti-human fibronectin hybridoma (ATCC, CRL-1606) was used 
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10 



20 



as a preferred template for the production of dicistronic 
molecules expressing heavy and light chain. 



25 



c. 



Pre paration of a V H= Cgdjpg Repertoi 



DNA 

Rearranged DNA isolated from PBLs, other lymphocytes, 
and hybridomas which express immunoglobulins can be used 
to prepare a V H -coding repertoire. The amplification 

j tt — yanorl-riirp 11 =1 1 T\<3 

procedure tor preparmy a --«- 

rearranged DNA is performed as described in Example 23. 



23. PrP paratiQ " *f DNA HomolpqS 
A . v -rndina nrmble St ^nded DNA Homologs 

Cloned DNA, prepared in Example 22 from a combina- 
torial library that encodes human Fab fragments which bind 
tetanus toxoid (TT) , was used as a template for preparing 
15 a V H -coding double stranded DNA homolog. Human heavy 
chain, containing both the V H and C H 1 coding region and 
designated as Fd, was amplified in a PCR reaction. THe 
amplification was performed in a 100 ul reaction contain- 
ing 5 nanograms (ng) of the cloned DNA in PCR buffer 
consisting of the following: 10 mM Tris-HCl, pH 8.3; 50 
mM KC1, 1.5 mM MgCl 2 ; 0.001% (w/v) gelatin; 200 mM of each 
dNTP; 200 nanomolar (nM) of each primer; and 2.5 units of 
Taq DNA polymerase. The human V H outside primer and C H 1' 
inside primer were used as a PCR primer pair for amplifi- 
cation of the heavy chain (Table 11 and Figure 21) . The 
reaction mixture was overlaid with mineral oil and sub- 
jected to 40 cycles of amplification. Each amplification 
cycle (thermocycle) involved denaturation at 94 'C for 1.5 
minutes, annealing at 54 -C for 2.5 minutes and poly- 



i V ' '■ * - ' ™ >J 



homolog containing samples were then gex puntiea. 
extracted twice with phenol/chloroform, once with chloro- 
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form followed by ethanol precipitation and were stored at 
-70 °C in 10 mM Tris-HCl, pH 7.5, and 1 mM EDTA. 

To verify the amplification of the heavy chain, the 
PGR purified products were electrophoresed in an agarose 
5 gel. The expected size of the heavy chain was approxi- 
mately 730 base pairs as shown in Figure 23. The V H - 
coding double stranded DNA homologs were then used in 
subsequence PCR amplification reactions with V L -coding 
counterparts prepared below for the production of dicis- 
10 tronic DNA molecules having V H and V L cistronic portions as 
illustrated in Example 24. 

B. V L -Codin q Double Stranded DNA Homologs 

Cloned DNA, prepared in Example 22 from a combina- 
torial library that encodes human Fab fragments which bind 

15 tetanus toxoid (TT) , was used as a template for preparing 
a V L ~coding double stranded DNA homolog. Human light 
chain, containing the entire coding region of kappa light 
chain (V L and C L ) , was amplified using the same PCR condi- 
tions described for human heavy chain with the exception 

20 that a human V L inside primer and C L ' outside primer were 
used as the PCR primer pair (Table 11 and Figure 21) . The 
resultant V L -coding double stranded DNA homolog was gel 
purified and stored as described above. 

To verify the amplification of the light chain, the 

2 5 PCR purified products were electrophoresed in an agarose 
gel. The expected size of the light chain was approxi- 
mately 690 base pairs as shown in Figure 23. The VL- 
coding double stranded DNA homologs were then used in 
subsequent PCR amplification reactions with V H -coding 

30 counterparts prepared above for the production of 
dicistronic DNA molemlp^ innc^-r3f D H ^ p V iwin -> • 
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24. pre paration nf TntemaU v-Primed Duplexes of V„- and 
^ -Coding DNA Homolog 

A. H yhridlzati ™ ^ V„- MlQ] V, -Coding DNA Homoloqs 

The V - and V L -coding double stranded DNA homologs 
5 prepare in Examples 23A and 23B, respectively, were 
admixed together and denatured at 95 «C for 5 minutes to 
separate the strands of each homolog. The denatured V H - 
and V L -coding DNA strands in the admixture were then 
annealed at 54 *C for 5 minutes to form a V„= and V L ceding 

10 duplex DNA molecule hybridized at the 3' ends at region X 
of each original homolog. One strand of the X region 
(cistronic) bridge encodes at least one stop codon in the 
same reading frame as the upstream cistron, a ribosome 
binding site downstream from the stop codon, and a 

15 polypeptide leader (pelB) having a translation initiation 
codon in the same reading frame as the downstream cistron 
located downstream from the ribosome binding site. 

B . Primer Extension fcg Produce Dicistronic DNA Molecules 
The hybridized recombinant V H - and V L -coding DNA 

20 molecule (internally primed duplex) was subjected to 
primer extension and then amplified with only the V H and 
C • primers following the PCR reaction procedure described 
in Example 23A. This second PCR reaction is schematically 
represented in Figure 21. The PCR reaction products were 

25 gel electrophoresed to verify the presence of the result- 
ant V- and V, -coding dicistronic DNA molecules. The 
expected size of the dicistronic molecule was about 1390 
base pairs and is shown in Figure 23. The resultant V H - 
and V L -coding dicistronic DNA molecules were then ligated 

30 into the modified ImmunoZAP H vector (Figures 24A and 24B) 
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2. Preparation of Mou se Hvbridoma V, r and V r Codinq 
Double Stranded DNA Homoloas and Production — o£ 
Dicistron i, c DNA Molecules in a Single Amplification 
Reaction 

5 Mouse hybridoma heavy and light chain cDNA prepared 

in Example 22B was amplified in a single PCR reaction 
using the reaction conditions given above with an excess 
of the outside primers (200 nM concentration of both the 
mouse V H primer and C L ! primer) and a limiting amount of 
10 the inside primers (20 nM concentration of both the mouse 
C u l" and V. primer) (Table 11). The resultant mouse heavy 
and light chain dicistronic molecules were then inserted 
into a modified ImmunoZAP H for construction of an 
expression vector as described in Example 30. 

15 26. Preparation of In ternal Iv-Primed Duplexes Using a 
Single Internal Primer that Overlaps Both the V„ and 
v L Repertoires 

Another approach to producing a library of dicis- 
tronic DNA molecules is to use a single internal primer 
2 0 instead of using two separately internal primers. The 
process of creating a dicistronic molecule comprising an 
upstream V H cistron and a downstream V L cistron is to 
combine in a PCR buffer the following: a repertoire of V H 
genes consisting of at least 10 5 different genes; a reper- 

25 toire of V L genes consisting of at least 10 4 different 
genes; an outside V H primer; an outside V L ; and a poly- 
nucleotide strand having a 3* -terminal priming portion, a 
cistronic bridge coding portion, and a 5' terminal primer- 
template portion. The PCR reaction is performed as 

30 described in Example 22A. 



outside primers. The d ' -terminal priming portion encodes 
3 5 a nucleotide base sequence homologous to a portion of the 

~>--; r p-r pyt-onqiop product nf the other ontnide rriner 
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cistronic bridge coding portion encodes at least one stop 
codon in the same reading frame as the upstream cistron, 
a ribosome binding site downstream from the stop codon and 
a polypeptide leader (pelB) having a translation initia- 
5 tion codon in the same reading frame as the downstream 
cistron where the initiation codon is located downstream 
from the ribosome binding site. Polynucleotide strand 
(linker) primers useful in this invention are listed in 
Table 12. 

10 Table 12 

Polynucleotide Strand fLinlcer^ Primers 

Seg. 
Id. No. 

(1251) 1 1' 5' GGAGAGTGGGTCATCACGAGCTCAGCCATGGCAGGTTGG 
15 GCAGCGATTAATAACAATCCAGCGGCTGCCGTAGGCAAT 

AGGTATTTCATTATGACTGTCTCCTTGCTATTAACTAGT 

ACAAGATTTGGGCTC 3' 
(126) 2 2' 5' GAGCCCAAATCTTGTACTAGTTAATAGCAAGGAGACAGT 

CATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 
2 0 GTTATTAATCGCTGCCCAACCTGCCATGGCTGAGCTCGT 

GATGACCCACTCTCC 3 ' 

1 Primes mRNA (sense strand) of heavy chain C H 1 
region; antisense strand of light chain V L with dicistronic 
bridge in between heavy and light chains will be in the 

2 5 same relative orientation as given in the example. 

2 Primes antisense strand of heavy chain C H 1 regicris; 
and sense strand of light chain V L region with dicistronic 
in between heavy and light chains will be in the same 
relative orientation as given in the example. 

30 The resultant single step internally primed 

, * ■> • * ■ 1 



^sc r i n&i.x xampi «... 
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27 . Preparation of Lambda Zap II Expression Vector 

The vector Lambda Zap™ II (Stratagene) is a 
derivative of the original Lambda Zap (ATCC # 40,298) that 
maintains all of the characteristics of the original 
5 Lambda Zap including 6 unique cloning sites, fusion 
protein expression, and the .ability to rapidly excise the 
insert in the form of a phagemid (Bluscript SK-) , but 
lacks the SAM 100 mutation, allowing growth on many Non- 
Sup F strains, including XL1— Blue* The Lambda Zap II was 

10 constructed as described in Short et al., Nucleic Acids 
Res. , 16:7583-7600, (1988) , by replacing the Lambda S gene 
contained in a 4254 base pair (bp) DNA fragment produced 
by digesting Lambda Zap with the restriction enzyme Ncol. 
This 4254 bp DNA fragment was replaced with the 4254 bp 

15 DNA fragment containing the Lambda S gene isolated from 
Lambda gtio (ATCC # 40,179) after digesting the vector 
with the restriction enzyme Ncol. The 4254 bp DNA frag- 
ment isolated from lambda gtlO was ligated into the 
original Lambda Zap vector using T4 DNA ligase and 

20 standard protocols for such procedures described in 
Current Protocols in Molecular Biology . Ausubel et al., 
eds., John Wiley and Sons, NY, 1987. 



28 . Preparation of V n -Expression Vectors, ImmunoZAP H and 
Modified ImmunoZAP H. Construction 
25 A. ImmunoZAP H 

The main criterion used in choosing a vector system 
was the necessity of generating the largest number of Fab 
fragments which could be screened directly. Bacteriophage 
lambda was selected as the expression vector for three 
3 0 reasons. First, in vitro packaging of phage DNA is the 



■ , _fc_ A. -..X >- ■ ■* ■ ,. *. ^.A A . 

screening or phage libraries typically involve less diffi- 
35 culty with nonspecific binding. The alternative, plasmid 

-lonina vpc f or? aro ~r 1 v advantaaeous ir tho irilvsir 
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clones after they have been identified. This advantage is 
not lost in the present system because of the use of 
lambda Zap, thereby permitting a plasmid containing the 
heavy chain, light chain, or Fab expressing inserts to be 
excised. 

To express the plurality of V H -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the V H -coding DNA homologs in the proper reading frame, 

as described by Shine et 



provided a ribcsoine Em^n^ 



10 al., Nature , 254:34, (1975), provided a leader sequence 
directing the expressed protein to the periplasmic space, 
provided a polynucleotide sequence that coded for a known 
epitope (epitope tag) and also provided a polynucleotide 
that coded for a spacer protein between the V H -coding DNA 
15 homolog and the polynucleotide coding for the epitope tag. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 
single stranded polynucleotide segments of 20-4 0 bases 
that would hybridize to each other and form the double 
20 stranded synthetic DNA sequence shown in Figure 25A. The 
individual single-stranded polynucleotides (N^N^) are 
shown in Table 13 below. 





Table 3,3 








Seq. 








25 


Id. No. 








(91) 


Nl) 


5 1 


GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 1 




(92) 


N2) 


5 1 


AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 1 




(93) 


N3) 


5 f 


GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 1 




(94) 


N4) 


5 1 


AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 3 1 


30 


(95) 


N5) 


5 1 


TCGACTATTAACTAGTCTAGAATTCTCGAG 3 ■ 








cr f 








N9-4 


) 1 


, 1 AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 


35 


(100) 


Nil) 


5 


' GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 ' 




t 1 o i > 


Nl 7 ^ 




T TCGACGAATTCTATTAAGAACCGTAGTC 1 ' 
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(102) N10-5) 5 1 CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3' 
Polynucleotide N2, N3, N9-4 ' , Nil, N10-5', N6, N7 and 
N8 were kinased by adding 1 /il of each polynucleotide (0.1 
ug/^il) and 20 units of T 4 polynucleotide kinase to a solu- 
5 tion containing 70 mM Tris-HCl at pH 7.6, 10 mM MgCl 2 , 5 mM 
DTT, 10 mM beta mercaptoethanol , 500 ug/ml of BSA. The 
solution was maintained at 37 *C for 30 minutes and the 
reaction stopped by maintaining the solution at 65 *C for 
10 minutes. The two end polynucleotides, 20 ng, of poly- 
10 nucleotides Nl and polynucleotides N12, were added to the 
above kinasing reaction solution together with 1/10 volume 
of a solution containing 20 mM Tris-HCl, pH 7.4, 2 mM MgCl 2 
and 50 mM NaCl. This solution was heated to 70 °C for 5 
minutes and allowed to cool to room temperature, approxi- 
15 mately 2 5 °C, over 1.5 hours in a 500 ml beaker of water. 
During this time period all 10 polynucleotides annealed to 
form the double stranded synthetic DNA insert shown in 
Figure 25A. The individual polynucleotides were 

covalently linked to each other to stabilize the synthetic 
20 DNA insert by adding 40 fil of the above reaction to a 
solution containing 50 mM Tris-HCl, pH 7.5, 7 mM MgCl 2 , 
1 mM DTT, 1 mM ATP and 10 units of T4 DNA ligase. This 
solution was maintained at 37 'C for 3 0 minutes and then 
the T4 DNA ligase was inactivated by maintaining the 
25 solution at 65 'C for 10 minutes. The end polynucleotides 
were kinased by mixing 52 ^1 of the above reaction, 4 jxl 
of a solution containing 10 mM ATP and 5 units of T4 
polynucleotide kinase. This solution was maintained at 
37 °C for 3 0 minutes and then the T4 polynucleotide kinase 
30 was inactivated by maintaining the solution at 65 'C for 10 
minutes . 



enzymes Not: ana Xhoi. The ligation mixture was packaged 
according to the manufacturer's instructions using 

niaanack TT Gold packina extract rsrrataaene 1 * nark- 
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aged ligation mixture was plated on XLl-blue cells 
(Stratagene) . Individual Lambda Zap II plaques were cored 
and the inserted excised according to the in vivo excision 
protocol provided by the manufacturer (Stratagene) . This 
in vivo excision protocol converts the cloned insert from 
the Lambda Zap II vector into a plasmid vector to allow 
easy manipulation and sequencing. The accuracy of the 
above cloning steps was confirmed by sequencing the insert 
using the Sanger dideoxy method described in by Sanger et 

al. f Prnn- Natl. *r«ri , Sci. USA , 74:5463-5467, (1977) and 

using the manufacturer's instructions in the AMV Reverse 
Transcriptase 35 S-ATP sequencing kit (Stratagene) . The 
sequence of the resulting V H expression vector is shown in 
Figure 25A and Figure 26. 



15 B. Modified TmmunoZAP H 

To create a fusion-PCR library from hybridoma RNA for 
expressing the plurality of V H -coding DNA homologs in an 
coli host cell, a vector based on the ImmunoZAP H vector 
described above was constructed. The procedure for con- 

2 0 struct ing the vector was performed as described above with 
the following modifications: elimination of the SacI site 
between the T 3 polymerase and NotI sites and changing the 
nucleotide base residue sequence from AAA to CAG which 
resulted in an amino acid residue change from lysine to 

2 5 glutamine as shown in Figures 24A and 24B. 

The individual single-stranded polynucleotides (N,, 
N , N 6 and N 7 ) , which were modified from their counterparts 
listed in Table 14, are listed in Table 14 below. 



Table 14 



v !'(;t:u'o^:i::ii aaa ; v 1 :A ; T' l^^uuhunurtu i - a 

(128) N2) 5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 

(129) N3) 5' GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3' 

: \ G G TG C AG C 7 G GT CG AG A ATT CT A G A CT A G GTT A A T * 
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(131) N5) 5' TCGACTATTAACTAGTCTAGAATTCTCGAG 3» 

(132) N6) 5 1 CAGCTGCACCTGGGCCATGGCTGGTTGGG 3' 

(133) N7) 5' CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3' 

(134) N8) 5 1 CTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGCGGCCGC 
5 3 f 

(135) N9-4) 5' AGGTGAAACTGCTCGAGATITCTAGACTAGTTACCCGTAC 3 1 

(136) Nil) 5 1 GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3' 

(137) N12) 5 1 TCGACGAATTCTATTAAGAACCGTAGTC 3 1 

( 13 8 ) N10-5 ^ 5 1 nGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3 1 

10 The modified ImmunoZAP H vector was created to 

eliminate an unnecessary SacI site in the ImmunoZAP H 
vector, (Example 28a, when the heavy and light chain 
vectors were combined. The modifications also improved 
the efficiency of secretion of positively changed amino 

15 acids in the amino terminus of the expressed protein. 
Inouye et al., Proc. Natl. Acad. Sci. USA . 85:7685-7689 
(1988) . 

29. Preparation of V L Expression Vector ImmunoZAP L 
Construction 

20 To express the plurality of V L coding polynucleotides 

in an E. coli host cell, a vector was constructed that 
placed the V L coding polynucleotide in the proper reading 
frame, provided a ribosome binding site as described by 
Shine et al., Nature, 254:34, (1975), provided a leader 

25 sequence directing the expressed protein to the peri- 
plasmic space and also provided a polynucleotide that 
coded for a spacer protein between the V L polynucleotide. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 

30 single stranded polynucleotide segments of 20-40 bases 
that would hybridize to each other and fo*rm the double 



3 5 Polynucleotides N2 , 

kinased by adding 1 ul of 



N3, N4, N6 f N7 and N8 were 
each polynucleotide and 20 i:nits 
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of T 4 polynucleotide kinase to a solution containing 70 mM 
Tris-HCl, pH 7.6, 10 mM MgCl 2 , 5 mM DDT , 10 mM 2ME, 500 
micrograms per ml of BSA. The solution was maintained at 
37 *C for 30 minutes and the reaction stopped by maintain- 
5 ing the solution at 65 'C for 10 minutes. The two end 
polynucleotides 20 ng of polynucleotides Nl and poly- 
nucleotides N5 were added to the above kinasing reaction 
solution together with 1/10 volume of a solution contain- 
ing 20 mM Tris-HCi, pH 7.4, 2 mM MgCi 2 and 50 mM NaCi. 
10 This solution was heated to 70 'C for 5 minutes and allowed 
to cool to room temperature, approximately 25 °C, over 1.5 
hours in a 500 ml beaker of water. During this time 
period all of the polynucleotides annealed to form the 
double stranded synthetic DNA insert. The individual 
15 polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert with adding 40 Ml of 
the above reaction to a solution containing 50 Ml Tris- 
HCl, pH 7.5, 7 mM MgCl 2 1 mM DTT, 1 mM ATP and 10 units of 
T4 DNA ligase. This solution was maintained at 37 *C for 
20 30 minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65 *C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 Ml of the above 
reaction, 4 (il of a solution recontaining 10 mM ATP and 5 
units of T4 polynucleotide kinase. This solution was 
25 maintained at 37 'C for 30 minutes and then the T4 poly- 
nucleotide kinase was inactivated by maintaining the 
solution at 65 'C for 10 minutes. 

The completed synthetic DNA insert was ligated 
directly into a Lambda Zap II vector prepared in Example 
27 that had been previously digested with the restriction 
enzymes NotI and Xhol. The ligation mixture was packaged 

I • i iv ....... c , . 1 ■ ~ * * ' *" 



Example Z8A. individual lambda Zap 11 plaques were cored 
and the inserts excised according to the in vovo excision 

in .-u r .„ : Exar.rl- ""A Thir in vivo 
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excision protocol converts the cloned insert from the 
Lambda Zap II vector into a phagemid vector to allow easy 
manipulation and sequencing and also produces the phagemid 
version of the V L expression vectors. The accuracy of the 
5 above cloning steps was confirmed by sequencing the insert 
using the Sanger dideoxy method described by Sanger et 
al., Proc. Natl. Acad. Sci. USA . 74 : 5463-5467 , (1977) and 
using the manuf acturer 1 s instructions in the AMV reverse 
transcriptase 35 S-dATP sequencing kit (Stratagene) ; The 

10 sequence of the resultin V L expression vector is shown in 
Figure 25B and Figure 27) . 

The V L expression vector used to construct the V L 
library was the phagemid produced to allow the DNA of the 
V L expression vector to be determined. The phagemid was 

15 produced, as detailed above, by the in vivo excision pro- 
cess from the Lambda Zap V L expression vector (Figure 27) . 

30. Construction of V | |L -Expression Vectors and Library 
A. Ligation of Dicistronic DNA Molecules with Modified 
ImmunoZAP H 

20 In preparation for cloning a library enriched in V H - 

v L -coding (V HL ) dicistronic DNA molecules, PCR amplified 
products (human or mouse) prepared in Examples 24, 25, and 
26 (50 mM NaCl, 25 mM Tris-HCl, pH 7.7, 10 mM MgCl 2 , 10 mM 
0-mercaptoethanol, 100 ug/ml BSA, at 37 *C were digested 

25 with restriction enzymes Xhol and Xbal at a concentration 
of 60 units of enzyme per ug of DNA, and purified on a 1% 
agarose gel. After gel electrophoresis of the digested 
PCR simplified dicistronic DNA molecules, the region of the 
gel containing the DNA fragments of approximately 13 60 

30 base pairs in size was excised, purified using Gene-Clean 
(BIO 101, La Jolla, California) , ethanol orecipi tated and 

r« .Lit:** : . ^ a ieu *jvei li-Lyiii. ^ uu ± uy ui 

3 5 modified ImmunoZAP H vector, prepared in Example 2 8b, 
(Stratagene) previously digested with Xhol and Xbal. A 
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portion of the ligation mixture (1 ul) was packaged for 2 
hours at room temperature using Gigapack Gold packaging 
extract (Stratagene) and the packaged material was plated 
on a permissive E. coli (strain XLl-blue) lawn to generate 
5 plaques. The library was determined to consist of predom- 
inantly V HL with less than 5% non-recombinant background. 

B. Screening of Antibo dy-Producing Plaques 
(i) Human 

To screen for expression of V HL dicistronic molecules, 
10 E. coli were infected to yield approximately 100 plaques 
per plate. Replica filter lifts of the plaques on an agar 
plate were produced by overlaying a nitrocellulose filter 
that had been soaked in 10 mM isopropyl beta- 
dithiogalactopyranoside on each plate with transfer for 
15 15 hours at 23 'C. For detection of V HL antibody fragment 
expression, the filters were screened with rabbit anti- 
human heavy and light chain antibodies followed by goat 
anti-rabbit antibody coupled to alkaline phsophatase 
(Cappel Laboratories, Malver, Pennsylvania). The detec- 

2 0 tion of immunoreactive product confirmed the presence and 

expression of V HL antibody fragments. 

To identify human DNA clones expressing antibody that 
bound TT, plaques were plated and proteins expressed as 
described above. Replica filters were incubated with 0.2 
25 nN 125 I-tetanus toxoid and washed. Positive plaques were 
identified by autoradiography and isolated. The frequency 
of positive clones in the library was equivalent to 
(number of positive clones) / [number of plaques screened) 
X (fraction of plaques expressing V HL ) . Concentrated non- 
30 adsorbed tetanus toxoid was iodinated with sodium iodide 

ii.. . "tsua. ' - ' 

Rockford, Illinois) . 

3 5 Human DNA clones were re-plated at approximately 100 

..v,-,„ n ^ov- ~i aT: o eridr bv - i io with tho nr^.tn 1 rhnn^ *-b^^ 
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were used as templates for PCR amplification and screened 
in the primary antigen binding screen. The results of the 
screening procedure are seen in Figure 28. Similar 
signals between the parental clones and the V HL dicistronic 
5 DNA molecules demonstrated that the sequence differences 
introduced with the C H 1' and V L primers did not adversely 
affect gene expression. Also, it should be noted in 
Figure 28 that a random parental clone that did not react 
with tetanus toxoid , 7G1, was unreactive before and after 
10 the PCR dicistronic fusion, as was the control ImmunoZAP 
H vector (IZ H) . 

(ii) Mouse 

Mouse antibody-producing plaques prepared in Example 
27 were screened for antibody expression with rabbit anti- 
15 mouse heavy and light chain antibody (Cappel Laboratories) 
as described above. 

3 1 . Characterization of Cloned Dicistronic V 1I L Repertoire 

in Expression Library 
A. Verification of Presence and Size of Cloned 
2 0 Dicistronic V | IL Repertoire 

Bacteriophage from purified reactive plaques prepared 
in Example 3 OB were converted to the plasmid format by in 
vivo excision with R408 helper phage according to manufac- 
turer' s protocol (Stratagene) and also described in Short 
25 et al., Nucl. Acids Res. , 16:7583-7600 (1988). In the in 
vivo excision protocol, the cloned insert from the 
ImmunoZAP H vector was converted into a phagemid vector to 
allow easy manipulation and sequencing. Briefly, phage 
plaques were cored from the agar plates and transferred to 
30 sterile mi crofnae i-nboc cmo ^ ■ rr 



ror excisions, 20U ui of the phage stock, 200 ul ot 
XLl-Blue cells (A^ = 1.00) and 1 ul of R408 helper phage 

35 n x 10 10 r>l acrue formina units Oofu> > were irmbata^ -> - 
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37 -c for 15 minutes. AFter a 4 hour incubation in Luria- 
Bertani (LB) broth and heating at 70*C for 20 minutes to 
heat kill the XLl-blue cells, the phagemids were re- 
infected into XLl-Blue cells and plated onto LB plates 
5 containing ampicillin. Double stranded DNA was prepared 
from the phagemid containing cells according to the meth- 
ods described by Holmes et al. f final- Biochem., 114:193, 
(1981). Clones were first screened for DNA inserts by 
restriction digests with Xhoi and Xbai. The detection of 
10 1390 base pair fragment on an agarose gel confirmed the 
presence of a V HL dicistronic molecule insert. 

B. se quencing of Plasm as from Expression Library 

Clones containing the putative V HL insert were 
seguenced using reverse transcriptase according to the 

15 general method described by Sanger et al., Proc. Natl. 
*n*A. sei.. USA . 74:5463-5467, (1977) and the specific 
modifications of this method provided in the manufac- 
turer's instructions in the AMV reverse transcriptase 35 S- 
dATP seguencing kit (Stratagene) . 

20 Nucleotide seguence analysis of several fusion clones 

indicated that the seguence of the fusion region was 
identical to that shown in Figure 22, proving that the 
clones were actually generated through a fusion PCR 
intermediate . 

25 c. Advantages of Fusion-PCR to Produce Dicistronic DNA 
Molecules 

PCR amplification can, therefore, be used to fuse 
seguences responsible for encoding subunits of a hetero- 
dimeric protein together into a single DNA fragment that 

there is only one heavy and light chain sequence to choose 
from, and thus the heavy: light pair is a "natural" pair. 
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However, if spleen, peripheral blood B-cell, or other 
lymphocyte mRNA is used as the source of template, the PCR 
fusion reaction to form a dicistronic DNA molecule can 
randomly pair heavy and light chains from different cells, 
5 producing a combinatorial library. In such a library, 
only a small fraction of the clones contain the original 
heavy and light chain pairs. This may not be a problem if 
the desired natural pair is well represented in the orig- 
inal B-cell population, as is the case with hyper immunized 
10 donors. However, if one wishes to find a naturally 
occurring rare specificity in a combinatorial library, one 
may have to screen a large number of clones. 

The fusion method presented here may offer a solution 
to the random combinatorial problem. If one begins with 
15 a very dilute population of B-cells (possibly in a medium 
that limits diffusion) , it may be possible for the dicis- 
tronic event to occur between naturally paired heavy and 
light chain sequences before significant mixing between B- 
cell RNA occurs. Thus, the fused heavy and light chain 
20 sequences would be the original pairs, and the resulting 
library would express predominantly the naturally occur- 
ring antibody specificities. Such a library would be 
highly preferable when rare natural specificities are 
sought . 

25 Another advantage to this method is that only one 

vector and one cloning step are necessary. This saves a 
substantial amount of time, resources, and effort. 
Moreover, the ease of the single PCR reaction greatly 
simplified the process of going from B-cell RNA to an E_t- 

30 coli library, making this approach a noteworthy alterna- 
tive to standard hybridoma technology. 

The foregoing is intended as illustrative n-r thp 

- : ; - • 4 i ■ c j :■ . i .: :,i ..... -.- t o u c ■ ■ . i. n t: i nveiviion . 
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Claims: 

1. A method of producing a nucleic acid vector 
encoding two or more desired genes, each from a family of 
genes, said genes being capable of together producing a 
5 characteristic that can be used to identify the vector 
encoding said desired genes from other vectors encoding 
other combinations of genes from said families of genes, 

which method comprises: 

a) randomly inserting into vectors one member from 

10 a first family of genes and one member from one or more 
other families of genes so that a population of vectors 
are created wherein each vector may contain one of the 
genes from said first gene family and one of the genes 
from each of said other gene families; 

15 b) identifying within said population of vectors a 

vector capable of detectably producing a desired charac- 
teristic resulting from the inclusion of one gene from 
said first gene family and one gene from each of said 
other gene families, and using said characteristic to 

20 distinguish the vector from other vectors within the 
population containing undesired combinations of gene 
members from said gene families. 

2. The method of claim 1 wherein said genes are 
inserted into a DNA vector at one or more integration 
2 5 sites, which method further comprises: 

a) preparing said vectors with one or more site- or 
region-specific recombination sequences; 

b) permitting, in the presence of one or more 
reagents facilitating said site- or region-specific 

30 recombination, a member of said first family of genes to 
combine* in a vertor with * ^^bp- ^ «^ ~* 



jie ;:ietnoa oi claim >- wnerem saia site- 
region-specific recombination site is recognized and acted 
35 on by flp recombinase. 
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4. The method of claim 2 wherein said site- or 
region-specific recombination site is recognized and acted 
on by ere recombinase. 

5. The method of claim 2 wherein said site- or 
5 region-specific recombination site is recognized and acted 

on by lambda integrase recombinase. 

6. The method of claim 2 wherein at least one of 
the vectors contains a sequence capable of being 
recognized and acted on by transposase. 

10 7. The method in claim 1 where said genes are 

inserted into a DNA vector at one or more integration 
sites, which method further comprises: 

a) cleaving said vector with one or more site- 
specific integration reagents; 

15 b) preparing the ends of genes from said first 

family of genes so that one end will ligate with an end of 
the vector cleaved by a first reagent and the other with 
an end of the vector cleaved by a second reagent; 

c) preparing the ends of said genes from said other 
20 gene families so that one end will ligate with an end of 

the vector cleaved by a third reagent and the other with 
an end of the vector cleaved by a fourth reagent; 

d) preparing at least one double stranded DNA 
linker fragment having one end ligatable to one end of 

25 said genes from said first family of genes and the other 
end ligatable to one end of genes from said other family 
of genes; 

e) mixinei said vprt-n-r ~onor- ,-,:-:> - - 



8. The method of claim 7 wherein said reagents are 
the same . 
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9. the method of claim 8 wherein said reagents are 
different* 

10. The method of claim 1, wherein said combination 
of genes is accomplished in vivo. 

5 11. A method of producing a host cell expressing two 

or more desired genes, each from a family of genes, said 
genes being capable of together producing a characteristic 
that can be used to identify the host cell expressing said 
desired genes from other host cells expressing other 
10 combinations of genes from said families of genes , which 

method comprises: 

a) randomly introducing into host cells one member 
from a first family of genes and one member from one or 
more other families of genes so that a population of host 

15 cells are created wherein each host cell may contain one 
of the genes from said first gene family and one of the 
genes from each of said other gene families; 

b) identifying within said population of host cells 
a host cell capable of detectably exhibiting a desired 

20 characteristic resulting from the inclusion of one gene 
from said first gene family and one gene from each of said 
other gene families, and using said characteristic to 
distinguish the host cell from other host cells within the 
population containing undesired combinations of gene 

25 members from said gene families. 

12. The method of claim 11 wherein said vectors are 
lambda bacteriophage vectors and the host cells are E. 
Coli . 



oeing capable of producing a characteristic that can be 
used to identify the vector encoding said genes from other 
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vectors encoding other members of the families of genes 
which method comprises: 

a) isolating a first population of vectors for 
which each member of said population may contain one 

5 member of a family of genes; 

b) inserting one member of a second family of genes 
into each of the vectors so that a population of vectors 
are created where each vector may contain one of the genes 
from said first family and one of the genes from said 

10 second family; 

c) identifying within said population of vectors a 
vector capable of producing a characteristic resulting 
from the inclusion of one gene from said first gene family 
and one gene from said second gene family, and using said 

15 characteristic to distinguish the vector from other 
vectors within the population containing other members of 
the gene families. 

14. A method of producing a nucleic acid vector 
encoding two or more genes belonging to families of genes, 
20 said genes being capable of producing a characteristic 
that can be used to identify the vector encoding said 
genes from other vectors encoding other members of the 
families of genes, which method comprises: 

a) isolating a first population of vectors, for 

2 5 which each member of said population may contain one 

member of a first family of genes and a nucleic acid site 
or region at which the population of vectors can be 
combined with a second population of vectors; 

b) isolating a second population of vectors, for 

3 0 which each member of said population may contain one 

member of a second family of genes and a nucleic acid site 

r, H • memhe: *~ Me - r ~ : ". anii . ■ jCMie., auu jut memDei ui 

3 5 the second family of genes may be combined and expressed 
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in each member of a diverse population of recombined 
vectors ; 

c) recombining populations of said first and second 
vectors and at said nucleic acid site or region thereby 

5 creating a diverse population of recombinant vectors each 
of which may express one member of the first family of 
genes and one member of the second family of genes; 

d) identifying within said population of 
recombinant vectors a vector capable of producing a 

10 characteristic resulting from the inclusion of one gene 
from each of said gene families. 

15. The method of claim 14 wherein said nucleic acid 
site is cleaved with site-specific reagent, which method 

further comprises: 
15 a) cleaving said first vector population with said 

reagent ? 

b) cleaving said second vector population with said 
reagent; 

c) mixing both vector populations together in a 
2 0 ligation mix and ligating the two populations. 

16. The method of claim 14 wherein said nucleic acid 
region is a homologous region capable of undergoing homo- 
logous recombination, which method further comprises 
inserting one or more members of said first and second 

2 5 populations into a single host capable of carrying out 
homologous recombination and allowing such homologous 
recombination to occur. 

17 . The method of claim 14 wherein said nucleic acid 



-i >"TP 



, . ;no u r . . i :: C LC '. . ■ u U U jl a L i o I U . . ■ . - ■ - MU i « - - ! S ■■ 

capable of carrying out site-specific recombination at 
said nucleic acid site and allowing said site-specif ic 

^-orrombi nation tn occur. 
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18. The method of claim 17 wherein said target site 
for site-specific recombination is of the family of sites 
selected from flp, lox, and gamma-delta. 

19. The method of any of claims 1, 11 , 13 or 14 
5 wherein said vectors are plasmid or cosmid vectors. 

20. The method of any of claims l f 11 , 13 or 14 
wherein said vectors are phage vectors. 

21. The method of any of claims 1, 13 or 14 wherein 
said vectors are lambda bacteriophage vectors. 

10 22. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the interaction of sequence-specific 
nucleic acids with genes from said first and second 
families of genes. 

15 23. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the hybridization of nucleic acid 
probes with genes from said first and second of families 
of genes. 

20 24. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the expression of one or both of genes 
from said gene families as an RNA molecule. 

25. The method of claim 14 wherein the identifica- 

25 tion of a particular vertor within +-bp mnnwh^^*- 
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26. The method of claim 25 wherein the protein 
molecule (s) contains a binding site for another molecule. 

27. The method of claim 26 wherein the protein 
molecule (s) contains an epitope recognized by an antibody. 

5 28. The method of claim 27 wherein the protein 

molecule (s) contains an immune molecule binding site for 
an epitope. 

29. The method of claims 14 wherein both genes 
express an RNA and/or polypeptide and said RNAs and/or 

10 polypeptides physically interact within a host to create 
said characteristic. 

30. The method of claim 29 wherein both genes 
express polypeptides that physically interact to form a 
neo-epitope recognized by an immune molecule. 

15 31. The method of claim 29 wherein both genes 

express polypeptides that physically interact to form a 
binding site for another molecule. 

32. The method of claim 31 wherein the polypeptides 
are derived from antibody genes such that the interaction 

20 of both polypeptides forms an antigen binding site. 

33. The method of any of claims 1, 11, 13 or 14 
wherein the vectors contain a single promoter that 
expresses the genes from said gene families. 

34. The method of any of claims 1, 11, 13 or 14 



35. The method of claim 11 wherein the host is a 
mammalian cell. 
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36. The method of claim 11 wherein the host is a 
eukaryotic cell. 

37. The method of claim 11 wherein the host is a 
prokaryotic cell. 

5 38. The method of any of claims 1, 11 , 13 or 14 

wherein there are more than two gene f snail ies and the 
vectors produced contain a random assortment of one member 
of each gene family needed to create said characteristic. 

3 9 . HCFLP . 

10 40. LCFLP . 

41. A method of producing a biological agent having 
a desired phenotype wherein said phenotype results from 
expression of a particular combined nucleotide sequence 
and wherein said phenotype can be used to identify the 
15 biological agent having the particular combined nucleotide 
sequence which comprises: 

(a) bringing together a first population of 
nucleotide sequences with one or more other populations of 
nucleotide sequences to produce combined nucleotide 

2 0 sequences wherein each separate combined nucleotide 
sequence comprises one member of each population of 
nucleotide sequences; 

(b) expressing said combined nucleotide sequences in 
biological agents; and 

25 (c) identifying those biological agents which 

express said desired phenotype. 



; iH.L:a, .i^eiiLt udvin^ jinei comDineu nucleotide 

3 0 sequences further comprising using said phenotype to 
distinguish those biological agents expressina ^hp 
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particular combined nucleotide sequence from biological 
agents having other combined nucleotide sequences. 

43. A method according to claim 41 wherein said 
biological agent is a cell. 

5 44. A method according to claim 41 wherein said 

biological agent is nucleic acid vector. 

45. A method according to claim 41 wherein said 
biological agent is a bacteriophage or virus. 

46. A method according to claim 41 wherein said 
10 phenotype results from expression of a hybrid polypeptide 

which is encoded by the particular combined nucleotide 
sequence and is encoded at least in part by one nucleotide 
sequence from each population of nucleotide sequences 
which was brought together. 

15 47. A method according to claim 41 wherein said 

phenotype results from expression of a plurality of 
polypeptides wherein a polypeptide is encoded at least in 
part by one nucleotide sequence from each separate 
population of nucelotide sequences which was brought 

2 0 together. 

48. A method according to claim 41 wherein two 
populations of nucleotide sequences are combined. 

49. A method according to claim 47 wherein said 
phenotype results from expression of a heterodimeric 

_ ^ . -juli.wL. ,:..ne: •■uDun.. 

or said diner is encoded at least in part by the nucleic 
sequence from the second population of nucleotide 

^ ^ r* p rjn \ c* rj C ° 0 
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50, A method according to claim 48 wherein said 
phenotype results from expression of a first polypeptide 
encoded at least in part by the nucleotide sequence from 
the first population of nucleotide sequences and of a 

5 second polypeptide encoded at least in part by the 
nucleotide sequence from the second population of 
nucleotide sequences. 

51. A method according to claim 48 wherein said 
phenotype results from expression of an RNA molecule 

10 encoded at least in part by the nucleotide sequence from 
the first population of nucleotide sequences and a second 
RNA molecule encoded at least in part by the nucleotide 
sequence from the second population of nucleotide 
sequences . 

15 52. A method according to claim 48 wherein said 

phenotype results from synthesis of an RNA molecule 
encoded at least in part by the nucleotide sequence from 
the first population of nucleic acid sequences and by the 
nucleic acid sequence from the second population of 

20 nucleic acid sequences. 

53. A method according to claim 48 wherein the first 
and second populations of nucleotide sequences are 
combined by co-infection or co-transformation of host 
cells. 

25 54. A method according to claim 48 wherein members 

from said first and second populations of nucleotide 
sequences are combined randomly to give combined 

nucleotide secmence? 

.onujininy ^ saia populations oi nucleotide sequences 
gives a combined nucleotide sequence which was not 
nrevi ousl v expressed in said bioloaica! aaent . 
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56. A method according to claim 41 wherein said 
desired phenotype comprises a phenotype which was not 
previously expressed in a population of such biological 
agents . 

5 57. A method according to claim 41 wherein said 

first population of nucleotide sequences comprises non- 
identical nucleotide sequences. 

58. A method according to claim 41 wherein each 
population of nucleotide sequences comprises non- identical 

10 nucleotide sequences. 

59. A method of producing a nucleic acid vector 
encoding a preselected combined nucleotide sequence which 
comprises two or more preselected nucleotide sequences, 
each independently selected from a population of nucleo- 

15 tide sequences, said combined nucleotide sequence being 
capable of producing a characteristic that can be used to 
identify the vector encoding said preselected combined 
nucleic sequence comprises 

(a) bringing together a member nucleotide sequence 
20 from each population of nucleotide sequences to give a 

population of combined nucleotide sequences wherein each 
combined nucleotide sequence comprises a nucleotide 
sequence from each population; 

(b) inserting into vector a member of the population 
25 combined nucleotide sequences so that a population of 

vectors is created wherein each vector may contain a 
combined nucleic acid sequence? 

(c) identifying within said population of vectors, 

j .. r - r i ^ v ^ ..a -. i -« • - >■ 3 : - -■ * ■ 
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60. A method according to claim 59 wherein said 
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encoding the preselected combined nucleotide sequence from 
other vectors encoding other combinations of nucleotide 
sequences further comprising using said characteristic to 
distinguish the vector from other different vectors within 
5 the population having unselected combined nucleotide 
sequences . 

61. A method according to claim 60 wherein said 
nucleotide sequences are combined randomly. 

62 . A method according to claim 61 wherein said 
10 combined nucleotide sequences are produced using fusion 

polynucleotide amplification. 

63 . A method according to claim 59 wherein said 
combined nucleotide sequences are produced using fusion 
polynucleotide amplification. 

15 64. A method according to claim 1 wherein a 

dicistronic or multicistronic DNA sequence which comprises 
one member from the first family of genes and one member 
from one or more than families of genes which comprises a 
random combination of said members of said families of 

20 genes is synthesized using fusion polynucleotide 
amplification and inserted into vectors. 

65. A method for producing a biological agent having 
a desired novel phenotype wherein said phenotype results 
from expression of a particular combined nucleotide 
25 sequence and wherein said phenotype can be used to 
identify the biological agent having % the particular 
combined niirl eotidp ^o^^^^^o • - ^ -«^ r -.- : - 



,;iutdtions Lu occur m eitner nucleotide sequence to 
generate a population of diverse replicas of each parent 
nucleotide secruence ; 



SUBSTITUTE SHEET 



WO 91/16427 



PCT/US91/02910 



163 

(b) randomly bringing together the populations of 
diverse replicas to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas; 
5 (c) expressing said combined nucleotide sequences in 

biological agents ; and 

(d) identifying those biological agents which 

express said desired phenotype. 

66. A method according to claim 65 wherein said 
10 desired phenotype is distinguishable from phenotypes 

expressed by said parent nucleotide sequences. 

67. A method according to claim 66 wherein said 
phenotype can be used to distinguish it from biological 
agents having other combined nucleotide sequences using 

15 said phenotype to distinguish those biological agents 
expressing the particular combined nucleotide sequence 
from biological agents having other combined nucleotide 
sequences. 

68. A method according to claim 65 wherein the 
20 parent nucleotide sequences comprise a single DNA molecule 

and are replicated together; further comprising separating 
the populations of diverse replicas of each parent 
nucleotide sequence prior to bringing together step (b) . 

69 . A method according to claim 68 which comprises 
25 replicating two parent nucleotide sequences. 

70. A method according to claim 65 wherein the 



l ep^icat-ing two parent nucleotide sequences. 
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72. A method according to 71 wherein a first parent 
nucleotide sequence is replicated in one population of 
cells and a second parent nucleotide sequence is repli- 
cated in a second population of cells and said cell 

5 populations are mixed and fused to generate cells which 
express combined nucleotide sequences. 

73. A method according to claim 72 wherein said 
first parent nucleotide sequences codes for a selected V L 
and said second parent nucleotide sequences codes for a 

10 selected V H , said cells are E. coli; and said combined 
nucleotide sequences express a Fab. 

74. A method for producing a biological agent having 
a desired phenotype wherein said phenotype results from 
expression of a particular combined nucleotide sequence 

15 and wherein said phenotype can be used to identify the 
biological agent having the particular combined nucleotide 
sequence which comprises: 

(a) replicating parent populations of nucleic acid 
sequences to generate a population of diverse replicas of 

20 each parent population: 

(b) randomly bringing together the populations of 
diverse replicas to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas; 

25 (c) expressing said combined nucleotide sequences in 

biological agents; and 

(d) identifying those biological agents which 
express said desired phenotype. 



sequences . 
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76. A method according to claim 75 wherein said 
phenotype can be used to distinguish said biological agent 
from biological agents having other combined nucleotide 
sequences, further comprising using said phenotype to 
5 distinguish those biological agents expressing the 
particular combined nucleotide sequence from biological 
agents having other combined nucleotide sequences. 
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FIG. 5-1 

Subclass I (A) 
#L39 

CTCGAGTCAGGACCTGGCCTCGTGAAACCTTCTCAGTCTCTGTCTCTC 
ACCT GCTCTGTCACTGGCTACTCCATCAC CAGSS^S2&22A CIgGAAC 

TGGATCCGGCAGTT 

Subclass II (A) 
#L11 

CTCGAGTCTGGGCCTnAACTGGCAAAACCTGGGGCCTCAGTGAAGATG 
T ^r» T ^^aa^^r^r^r:r:rrAr&rrTTGAgT AGTTACTGGATACACT GG 

GTAAAAnAG AGG CC 
#L03 

CTCGAGTCTGGACCTnAGCTGGTAAAGCCTGGGGTTCAGTGAAGATGT 
^r^rrnTvar^r^r^^A^ArArATTnA^AGCTATGTTATACACTGGG 

TGAAGCAGAAGCCT 

FIG. 5-2 

#L32 

CTCGAGTCTGGACCTGAACTGGTAAAGCCTGGGACTTCAGTGAAGATG 
TCPTGgAAGGCTTCTGGATACACATTCACC AGCTATGTTATGCGCT GG 

GTG AAGCAGAAGC C 



Subclass II (B) 
#L37 

CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTTCAGTGAAGTTG 
TfTfTTGPA AG G CTTCTGG CT ACAC CTT CAC nAG CTA CTAT ATGT ACT GG 

GTGAAGCAGAGGCr 



CTCGAGTCTGGGGCTAAGCTGGTAAGGCCTGGAGCTTnAGTnAAGCTG 
TrrTGnAGGGCTTCTGGCTACTCCTTCACn AGCTACTGGATGAAC TGG 

-» i-n /-» n -* rt * <"* 

, ^ /-V v.- ^3 v.... 
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FIG. 5-3 

Subclass II (C) 
#L33 

CTCGAGTCTGGGGCTGAGCTGGTGAGGCCTGGAGCTTCAGTnAAGCTG 
^CTG^GGCCrrCTCGTACTCCTTCACCAGCTCCTGATAACTGGGTG 

AAGCAGAGGCCTGG 

Subclass III (B) 
#L36 

CTCGAGTCAGGAGGTGGCCTGGTGCAGCCTGGAGGAT^ 
TCCTGTGCAGCCTCAGGATTCGATTTnAGn A G A T A CTGG ATGAATTGG 

GTCCGGCAGCTCCA 



#L02 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAATCTC 
CCCTCTC CJ.CCCTffL'^A TT^P tTWTWT>w ar -" tr ' tTR ATGGATGAGTTGG 
GTTCGGCAGGCTCC 

FIG. 5-4 

#L31 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAAAGTC 

TCCTCTCrA^^^^ a ^ a ^^^' T ^^^^ R ^ ATAOTGGATGAG ^ GG 
GTCCGGCAGCTCCA 



#L34 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTCAAACTC 
TCCTCTG ^r-r*r*r r r*r.r.i'wrr, RTTTn ft Gn AG ATACTGG ATG AGTTGG 
GTCCGGCAGCTCCA 



CTCGAGTCAGGAGGTGGCCTGGTGCAGCCTGGAGGAGCCCTGAAACTC 
TCCTCTCCA ^^r^riir:narrrrn&T^TnAGnAGATACTGGATGAGTTGG 

GTCCGCAGCTCCAG 
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FIG. 5-5 

Subclass III (C) 
#L10 

CTCGAGTCTGGGGGAGGCTTAGTnCAGCCTGGAG^TCCCGGAA^C 

TCCTGTGCAGCCTCTGGATTCACOT 
ATTCGTCAGGCTCC 

#L08 

CTCGAGTCTGGGGGAGGCTTAGTnnAGCCTGGAGGGTCCCGGAAACTC 

T^CTGTGCAGCCTCTGGATTCACTTTnAGliAG^ 

GTTACGTCAGGCTC 



Subclass V (A) 
#L38 

CTCGAGTCAGGGGCTGAACTGGTGAGGCCTGGGCGTTCAGTi^GATG 
T CCTG CJy^GG CTTCAGG CTATT C CTT CACC A G fT A CTGG AT G CACTGG 
GTGAAACAGAGGCC 

FIG. 5-6 

Miscellaneous 
#47 

CTCGAGTCAGGGGCTGAACTGGCAAAACCTGGGGCCTCAGTAAAGATG 
TCCTGCAAGGCTTCTGGCTACACCTCTTCTTCCTTCTGGCTGCACTGG 

AT AAAAG AAGG C CT 
#L35 

CTCGAGTCTGGACCTnAGCTGGTGAAGCCTGGGG TTCA GTTAAAATAT 
CCTGCAAGGCTTCTGGTTACTCATTTTCTnTCTACTTTGTGAACTGGG 

TGATGCAGAGCCAT 



CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTCAGTAAGTTGTC 
CTGAAGGCTTCTGGCTACACCTTCACCGGCTACTATATGTACTGGGTG 

AAGCAGAGGCCTGG 
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FIG. 6A-1 



y u EXPRESSIO N VECTOR: 
-Hi - 



SHINE-DALGARNO MET 

■tr— i 



GGCCGCAAATTCTATTTCAAGGAGACAGTCATAATG 
CGTTTAAGATAAAGTTCCTcrtiTCAGTAxxiiv. 



LEADER SEQUENCE 

'AAATACCTATTGCCTACGGCAGCCGCT 
TTTATGGATAACGGATGCCGTCGGCGA 



LEADER SEQUENCE 

GGATTGTTATTACT CGCTGCCCAACCAG 
CCTAACAATAATGAGCGACGGGTTGGTC 



FIG. 6A-2 



LINKER 



NCOI 



V H BACKBONE 



LINKER 



XHOI 



SPEI 



CCATGGCCCAGGTGAAACTGCTCGAG'ATTTCTAGACTAGT 
GGTAC CGGGT C CACTTTG ACG AG CT CT AAAG AT CTGAT C A 



T^-u>- 0 i'vrAspvaa.i'roAscTyrAxa^ei 

TACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 
ATGGGCATG CT G CAAGG CCTGATG CCAAGAATTATCTT AAG CAG CT 
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FIG. 6B-1 



V, FYPKESSION VECTOR 



SHINE- DALGARNO 

GGCCGCAAATTCTATTTCAAGGAGACAGTCATA 
" CGTTTAAG ATAAAGTT CCT CTGTCAGTAT 



MET LEADER SEQUENCE 

ATGAAATACCTATTGCCTACGGCAGCCGCTGGA 
TACTTTATGGATAACGGATGCCGTCGGCGACCT 



LEADER SEQUENCE 



TTGTTATTACTCGCTGCCCAACCAG 
AACAATAATGAGCGACGGGTTGGTC 



FIG. 6B-2 



LINKER 



NCOI 



3. 



'H 



BACKBONE 



CCATGGCCCAGGTGAAACTG 
GGTACCGGGTCCACTTTGAC 



/OIU Jl 



X*' JO Jl 



CTCGAGAATTCTAGACTAGTTAATAG 

~AGCTCTTAAGATCTGAT CAATT AT C A G 
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FIG. 16. 
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FIG. 17. 
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INVIVO FLP MEDIATED RECOMBINATION 
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pe!B LEADER 

HEAVY CHAIN u k Y I I P T 

HOMOLOGY Spe I I Stops ribl m r t i l 



5> -6CCTACGG 

3' |ctcgggtttagaaca| tgatcaattatcgttcctctgtcagtattactttatggataacggatgcc 

HEAVY CHAIN DOWNSTREAM PRIMER (ChH 



LIGHT CHAIN 

A A A G L F F I A A Q P A M A Sac I HOMOLOGY 

C AGCC6CTGGATTGTT ATTAATC G CTGCCC A ACCTGC CATGGCTG A GCTC|GTG_MXACCC ACTC TCC-) 
GTCGGCGACCTAACAATAA- 5 ' LIGHT CHAIN UPSTREAM PRIMER (V L ) 



FIG. 22. 
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MODIFIED V H EXPRESSION VECTOR: 



Not I RIBOSOME BINDING SITE 
5' GAGCTGCGGCCGCAAATTCTATTTCAAGGAGACAGTWiTA 

3' re/VCCTCTTTA acatta a ifiTTOTTTCiriffTAT 

■ WWWW » » M w »vr» i« w «i »w i • ww i w • -»r t ■ 



Pel B LEADER 
MetLysTyrLeuLeuProThrAloAloAlaGlyLeiieuLeuLeiiAlo 

ATGAAATAttlATTGCCTACGGCAGCCGCTGGATTC 
TACTTTAmTAACGGATGCCGTCGGCGACCTAACAATAATGAGCGA 

FIG. 24a 

Ncol Xhol Xbol Spel 

AloGlnProAloMetAlaGlnVolGlnLeoLeuGkj Thr 
GimACCAGCCATGGCCCAimAGCTimAGATnCTAGACT 
CGGGTTGGTCGGTACCGGGTCCACGTCGACGAGCTCTAAAGATCTGA 



EcoRI 

SerTyrProVAspVolPraAspfyrGlySerStop 

AGH/«CCCTACGACGTTCCGGACTACGGTTCnAAlAGAATTCG 
TCAATGGGCATCCTGCAAGGCCTCATGCCAAGAATTATCTTAAGCAGCT 
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V H EXPRESSION VECTOR: 



Not I RIBOSOME BINDING SITE 

5' G6CCGCMATTCTATTTCAAGGAGACAG1CATA 
rmTAA^AAGntOT 



Pel B LEADER 

MetLysTyrLeuLeuProThrAloAlaAloGlyLeuLeuLeuLeuAlo 

ATGAAATAOTATOCTA(^CAGC(»CTGGATTGnAnACTCGCT 

TACTTTATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGA 

Ncol Xhol XbalSpel 

V H BACKBONE 

AloGlnProAloMetAloGlnVolLysLeuLeuGlu' Thr 
(XOWIDCACCWl^^ 

CGGGTTGGTCGGTACCGGGTCCAtTTTGACGAGCTCTAAAGATCTGA 



EcoRI 

SerTy rProTyrAspVol ProAspTyrGlySerStop 

AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 

TCAATGGGCATGCTGCAAGGCCTGATGCCAAGMTTATCTTAAGCAGCT 



FIG. 25a. 
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V L EXPRESSION VECTOR: 



EcoRi RiBOSOME BINDING SITE 

5' TGAATTCTAAACTAGTC GCCAA&GAGACAGTCATA 
3 1 TOMCIlMttT^^ 



Pel B LEADER 

Met LysTyrLeuleu ProThrAlo Ala AlaGty LeuLeuLeuLeu 
ATGAMTA(^TATTT5CCTACGGCAGCCGC7GGATTCnATTA(X 

TACnTATGWTAACGGATOTCGG(M)CTAACAAlAATGAG _ _ _ _ . 

FIG. 25b. 

Ncol Sad 

AloAlaGlnProAlaMetAloGluLeu 

GCTGCCCAACCAGCCATGGCCGAGCTC 

CGACGGGTTGGTCGGTACCGGCTCGAG 



Xbal 

Stop Stop 

GTCAGTTCTAGAGTTAAGCGGCCG 
C AGTC AAGATCTCA ATTCGCCGGCAGCT 
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